Cross-provider AI code review for Claude Code — evidence-based confidence scoring with Codex, Gemini & Claude
-
Updated
Apr 16, 2026 - Shell
Cross-provider AI code review for Claude Code — evidence-based confidence scoring with Codex, Gemini & Claude
The extraction API that shows its work. Product data extraction with per-field confidence scoring and extraction provenance. REST API + MCP server. 50 free calls/month.
Open-source LLM evaluation engine with statistical confidence scoring
Multi-agent AI task delegation architecture for n8n: orchestrator routes natural-language commands to specialist agents with confidence scoring and human-in-the-loop gates.
Zero-Noise utilities for safer product research and review signal analysis.
Research-grade Self-Correcting RAG agent built with LangGraph that retrieves knowledge, generates answers, evaluates grounding/relevance/completeness, and iteratively self-improves with confidence scoring and memory.
Extract structured data from any document — PDF, DOCX, HTML, CSV, plain text — using LLMs with Pydantic schema validation, per-field confidence scores, and source grounding.
System that aggregates outputs from multiple Large Language Models (GPT-4, Claude-3, custom models) to generate reliable, high-confidence results through consensus-based reasoning evaluation. Demonstrates sophisticated AI orchestration with 92.7% accuracy improvement over single-model.
AI-powered problem solver using dual-AI validation with 88%+ confidence scoring. By Yourox.ai
Smart Document Conversion for the AI Era - CPU-only, fast, with confidence scoring. Converts PDF, DOCX, PPTX, HTML, EPUB to Markdown, JSON, HTML, Text.
Backend document processing pipeline using n8n and Gemini AI. Receives files via webhook, extracts structured data, calculates confidence scores and stores results in Supabase and Google Sheets.
MFGC confidence scoring and safety gates for AI agents. Zero dependencies.
A modular AI-driven pipeline for cleaning, normalizing, and standardizing large-scale inventory data with automated SKU generation, confidence scoring, and human-in-the-loop validation.
Knowledge ingestion system with artifact lineage, replayable stages, and append-only persistence.
Add a description, image, and links to the confidence-scoring topic page so that developers can more easily learn about it.
To associate your repository with the confidence-scoring topic, visit your repo's landing page and select "manage topics."