Train Models Contrastively in Pytorch
-
Updated
Mar 26, 2025 - Python
Train Models Contrastively in Pytorch
Radient turns many data types (not just text) into vectors for similarity search, RAG, regression analysis, and more.
Think-on-Graph 3.0: Efficient and Adaptive LLM Reasoning on Heterogeneous Graphs via Multi-Agent Dual-Evolving Context Retrieval
A sample app for the Multimodal Retrieval-Augmented Generation pattern running in Azure, using Azure AI Search for retrieval and Azure OpenAI large language models to power Q&A experiences.
Production inference for encoder models - ColBERT, GLiNER, ColPali, embeddings etc. - as vLLM plugins for online and in-process deployment
[NAACL 2024] Official Implementation of paper "Self-Adaptive Sampling for Efficient Video Question Answering on Image--Text Models"
🧠 Multimodal Retrieval-Augmented Generation that "weaves" together text and images seamlessly. 🪡
🚀 HAG: Next-Gen AI | Neo4j + Weaviate Fusion | Dual-Similarity Retrieval | 100% Local & Private | Graph Intelligence Meets Vector Search
🔰 A Comprehensive RAG repository covering basic vanilla RAG techniques, advanced retrieval methods, hybrid search fusion approaches, hands-on reranking techniques with code + explanation 📚✨
Self-adaptive Planning Agent。自适应规划代理的多模态检索增强生成技术。
High-performance late-interaction retrieval engine for on-prem AI. ColBERT/ColPali multi-vector search with Rust fused MaxSim, Triton GPU kernels, ROQ quantization, LEMUR routing, WAL-backed CRUD, and a FastAPI server — single machine, CPU or GPU.
Anaya is a Content Engine that specializes in analyzing and comparing multiple PDF documents. It uses Retrieval Augmented Generation (RAG) techniques to effectively retrieve, assess, and generate insights from the documents.
📄 Multimodal RAG pipeline combining ColPALI visual retrieval, YOLO-DocLayNet layout detection, sentence embedding-based text retrieval, and LLaMA-4 completion for document question answering.
Repository for team Devs
LumiCite is a multimodal RAG system for academic papers, designed for multimodal evidence retrieval and citation-aware question answering.
Multimodal RAG and comparisons between language models. (Project for Deep Learning Module at the FHSWF)
A comprehensive Multimodal Retrieval-Augmented Generation (RAG) application that combines FastAPI backend with Streamlit frontend, supporting multiple AI models, advanced OCR capabilities, and intelligent document processing.
Vision transformer-powered knowledge extraction. Analyze any image: botanical taxonomy, cultural landmarks, object semantics. Generates adaptive study resources via generative AI.
SamvidAI — Enterprise Contract Intelligence powered by OpticalRAG Multimodal document understanding system for clause extraction, legal risk scoring, and explainable contract analysis using layout-aware RAG pipelines.
Add a description, image, and links to the multimodal-rag topic page so that developers can more easily learn about it.
To associate your repository with the multimodal-rag topic, visit your repo's landing page and select "manage topics."