Lex GPT

AI-powered search for Lex Fridman podcast.

This is a testbed for exploring Langchain functionality.

Dataset

Start with episode transcriptions from Whisper via @karpathy for first 325 episodes:

https://karpathy.ai/lexicap/index.html

Text splitting and OpenAI embeddings done via Langchain in scripts/get_data.ipynb.

Store embeddings in Pinecone.

Search

Use Langchain VectorDBQAChain to embed the user query and perform similarity search on Pinecone embeddings. Synthesize the answer from relevant chunks with ChatGPT. The relevant chunks with metadata (links) are displayed as source documents in the UI.

UI

This build on the excellent: https://github.com/mckaywrigley/wait-but-why-gpt

Credits

Thanks to Mckay Wrigley for his work on the UI and app design.

Of course, thanks for Lex Fridman for the excellent podcast and Karapthy for the Whisper transcriptions.

Contact

If you have any questions, feel free to reach out to me on Twitter!

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
components		components
pages		pages
public		public
scripts		scripts
styles		styles
types		types
README.md		README.md
license		license
next-env.d.ts		next-env.d.ts
next.config.js		next.config.js
package-lock.json		package-lock.json
package.json		package.json
postcss.config.js		postcss.config.js
tailwind.config.js		tailwind.config.js
tsconfig.json		tsconfig.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Lex GPT

Dataset

Search

UI

Credits

Contact

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Lex GPT

Dataset

Search

UI

Credits

Contact

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages