virattt/rag-reranking-gpt-colbert.ipynb

Last active October 30, 2025 02:24

Star (31) You must be signed in to star a gist
Fork (8) You must be signed in to fork a gist

Learn more about clone URLs
Clone this repository at <script src="https://gist.github.com/virattt/b140fb4bf549b6125d53aa153dc53be6.js"></script>
Save virattt/b140fb4bf549b6125d53aa153dc53be6 to your computer and use it in GitHub Desktop.

Download ZIP

rag-reranking-gpt-colbert.ipynb

Raw

rag-reranking-gpt-colbert.ipynb

Sorry, something went wrong. Reload?

Sorry, we cannot display this file.

Sorry, this file is invalid so it cannot be displayed.

truebit commented Jan 23, 2024

@Psancs05 thx

Author

virattt commented Jan 23, 2024

Great catch - updated 🙏

jsancs commented Jan 23, 2024

@virattt Do you know the difference between using:
query_embedding = model(**query_encoding).last_hidden_state.squeeze(0)
query_embedding = model(**query_encoding).last_hidden_state.mean(dim=1)

I have tested both and seems that the squeeze(0) returns better quality similar documents (maybe it's just the use-case I tried)

TripleExclam commented Jan 30, 2024

query_embedding = model(**query_encoding).last_hidden_state.squeeze(0) is correct since it returns a vector per token, whilst
query_embedding = model(**query_encoding).last_hidden_state.mean(dim=1) returns a single vector averaged over all tokens.

virattt/rag-reranking-gpt-colbert.ipynb

truebit commented Jan 23, 2024

Uh oh!

virattt commented Jan 23, 2024

Uh oh!

jsancs commented Jan 23, 2024

Uh oh!

TripleExclam commented Jan 30, 2024

Uh oh!