20/04/2026 16:47:29
Rolling Out Data Quality Overnight, without losing the plot: A Multi-Agent System for Speech Data Quality Management
Rishabh Kumar, Abhinav Painuli, Chriss Philip Saji, Devesh Soni, Amrith Krishna, Ganesh Ramakrishnan
20/04/2026 16:56:00
Rolling Out Data Quality Overnight, without losing the plot: A Multi-Agent System for Speech Data Quality Management
Rishabh Kumar, Abhinav Painuli, Chriss Philip Saji, Devesh Soni, Amrith Krishna, Ganesh Ramakrishnan
20/04/2026 23:18:34
Tables Decoded: DELTA for Structure, TARQA for Understanding
Jahanvi Rajput, Dhruv Kudale, Saikiran Kasturi, Utkarsh Verma, Ganesh Ramakrishnan
20/04/2026 23:26:37
LexGen: Domain-aware Multilingual Lexicon Generation
Ayush Maheshwari, Atul Kumar Singh, Karthika NJ, Krishnakant Bhatt, Preethi Jyothi, Ganesh Ramakrishnan
20/04/2026 23:30:39
Unified Wisdom: Harnessing Collaborative Learning to Improve Efficacy of Knowledge Distillation
Durga S, Atharva Abhijit Tambat, Ganesh Ramakrishnan, Pradeep Shenoy
20/04/2026 23:33:11
LEVOS: Leveraging Vocabulary Overlap with Sanskrit to Generate Technical Lexicons in Indian Languages
Karthika N J, Krishnakant Bhatt, Ganesh Ramakrishnan, Preethi Jyothi
20/04/2026 23:37:11
ARISE: Iterative Rule Induction and Synthetic Data Generation for Text Classification
Yaswanth M, Vaibhav Singh, Ayush Maheshwari, Amrith Krishna, Ganesh Ramakrishnan
20/04/2026 23:39:59
Linguistically informed automatic speech recognition in Sanskrit
Rishabh Kumar, Devaraja Adiga, Rishav Ranjan, Amrith Krishna, Ganesh Ramakrishnan, Pawan Goyal, Preethi Jyothi
20/04/2026 23:43:00
Consistency Is the Key: Detecting Hallucinations in LLM Generated Text By Checking Inconsistencies About Key Facts
Raavi Gupta, Pranav Hari Panicker, Sumit Bhatia, Ganesh Ramakrishnan
20/04/2026 23:48:44
Beyond Common Words: Enhancing ASR Cross-Lingual Proper Noun Recognition Using Large Language Models
Rishabh Kumar, Sabyasachi Ghosh, Ganesh Ramakrishnan
21/04/2026 00:18:27
TACTFUL: A Framework for Targeted Active Learning for Document Analysis
Venkatapathy Subramanian, Sagar Poudel, Parag Chaudhuri, Ganesh Ramakrishnan
21/04/2026 00:20:27
Temporally Aligning Long Audio Interviews with Questions: A Case Study in Multimodal Data Integration
Piyush Singh Pasi, Karthikeya Battepati, Preethi Jyothi, Ganesh Ramakrishnan, Tanmay Mahapatra, Manoj Singh
21/04/2026 00:21:58
WARM: A Weakly (+Semi) Supervised Math Word Problem Solver
Isha Pandey, Oishik Chatterjee, Aashish Waikar, Vishwajeet Kumar, Ganesh Ramakrishnan
20/04/2026 16:47:29
Rolling Out Data Quality Overnight, without losing the plot: A Multi-Agent System for Speech Data Quality Management
Rishabh Kumar, Abhinav Painuli, Chriss Philip Saji, Devesh Soni, Amrith Krishna, Ganesh Ramakrishnan
20/04/2026 16:56:00
Rolling Out Data Quality Overnight, without losing the plot: A Multi-Agent System for Speech Data Quality Management
Rishabh Kumar, Abhinav Painuli, Chriss Philip Saji, Devesh Soni, Amrith Krishna, Ganesh Ramakrishnan
Gradient Coreset for Federated Learning
Durga Sivasubramanian, Lokesh Nagalapatti, Rishabh Iyer, Ganesh Ramakrishnan
In Proceedings of The 12th IEEE Winter Conference on Applications of Computer Vision (WACV 2024)
SPEAR : Semi-supervised Data Programming in Python
Guttu Sai Abhishek, Harshad Ingole, Parth Laturia, Vineeth Dorna, Ayush Maheshwari, Rishabh Iyer, Ganesh Ramakrishnan
In Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, EMNLP 2022, Abu Dhabi (Demo paper)