Query Expension for Better Query Embedding using LLMs
-
Updated
Feb 18, 2025 - Python
Query Expension for Better Query Embedding using LLMs
Embedding Inversion via Conditional Masked Diffusion: recover original text from embedding vectors using parallel denoising. Live demo + training pipeline + technical report.
The repo provides the code for Qdrant for efficient image indexing and retrieval using models such as ColPali, ColQwen, and VDR-2B-Multi-V1, jina embeddings v4 etc enhancing multimodal search capabilities across various applications.
Self-hosted MCP server for hybrid semantic code search and repository intelligence.
Intelligent arXiv paper discovery engine with hybrid BM25+vector search and agentic RAG: fetch, index, and interrogate AI research papers using a lightweight state-machine pipeline, OpenSearch, and local LLMs.
A Model Context Protocol (MCP) server that provides semantic search over AWS Cloudscape Design System documentation.
Docs.AI RAG Chatbot is an advanced application designed to revolutionize document interactions through AI-driven capabilities.
A real-time news chatbot application built with modern web technologies. It delivers intelligent, AI-powered responses, supports multiple chat sessions with persistent history, and provides a responsive, user-friendly interface across devices.
A Node.js REST API that powers a RAG-based chatbot, handling data ingestion, vector search, and LLM-powered responses.
End-to-End Python implementation of Beck et al.'s (2025) economic sentiment analysis framework for constructing a high-frequency economic sentiment indicator using 1024-dimensional Jina embeddings and LLM-generated training data. Features L2-regularized classification and rigorous POOS econometric validation with DM-HAC tests for GDP forecasting.
A complete web data Retrieval-Augmented Generation (RAG) pipeline built with TypeScript and Bun that scrapes news articles using Selenium, embeds them with Jina's cloud embeddings API, and stores semantic vectors in Qdrant vector database for fast similarity search and AI-powered applications.
A decentralized protocol for authenticating AI-generated content — blending blockchain proofs, IPFS storage, and semantic AI embeddings to establish verifiable authorship trails.
Backend for a RAG-powered news chatbot providing real-time AI responses, semantic search, and news retrieval using Node.js, Socket.IO, PostgreSQL, Redis, and Qdrant.
Oh My Repos: Semantic search for GitHub starred repositories.
A full-stack chatbot that answers queries over recent news using Retrieval-Augmented Generation (RAG).
Sec2ndBrain transforms your digital life into a chat-ready RAG knowledge base. Powered by Groq and Jina AI, you can instantly talk to your saved notes, YouTube videos, and Tweets.
🌐 Enable seamless semantic search over AWS Cloudscape documentation for AI agents and coding assistants with this efficient MCP server.
Advanced High-Fidelity RAG pipeline featuring Agentic Hierarchy Parsing for structural correction and Context-Sensitive Late Chunking for preserving document context.
Demonstration of integration of Nvidia GPUs with Elasticsearch
🛠 Reconstruct original text from text embeddings using conditional masked diffusion to reveal reversible embedding representations efficiently and accurately
Add a description, image, and links to the jina-embeddings topic page so that developers can more easily learn about it.
To associate your repository with the jina-embeddings topic, visit your repo's landing page and select "manage topics."