CAT FieldIQ

🏁 Inspiration

Field inspections are essential but time-consuming. Operators juggle photos, notes, paperwork, part lookups, and site planning—all while working in fast-moving, high-risk environments.
We wanted to imagine what would happen if CAT Inspect and CAT AI Assistant were fused into one intelligent, mobile-first companion that transforms raw field input into instant, actionable insight.
That idea became CAT FieldIQ.

🚜 What it does

CAT FieldIQ is a multimodal AI assistant for field operations.
From a phone, operators can:

📸 Take a machine photo → get a full PASS/FAIL/MONITOR inspection report
🎤 Speak notes → convert to structured inspection logs
🔧 Snap a part → receive ranked part numbers with fitment certainty
🗺️ Enter site details → generate multiple optimized layout plans with ASCII diagrams
📝 Create a unified report combining all results

It’s fast, structured, operator-friendly, and works entirely in the browser.

🏗️ How we built it

Next.js 14 (App Router) for a lightweight mobile-first interface
Tailwind CSS with a CAT-themed palette for a native feel
OpenAI GPT-4o for:
- Vision-based inspections
- Part identification
- Site layout reasoning
- Report summarization
Next.js API routes to structure JSON outputs and enforce safety
Client-side React components for smooth mobile workflows, image previews, and result cards

Our focus was clarity and reliability: simple screens, minimal text, strong visual hierarchy, and single-tap actions.

⚠️ Challenges we ran into

Ensuring GPT-4o returned strict JSON for inspection and parts data
Designing a UI that felt native on mobile, not like a shrunk web form
Getting layout plans to produce readable, consistent ASCII diagrams
Handling image upload constraints across browsers
Managing multiple multimodal flows (vision, voice, text) under hackathon time pressure

🏆 Accomplishments we're proud of

A fully working, multimodal AI assistant built end-to-end
Clean mobile interface designed for real CAT field workflows
Structured AI reports that genuinely feel operator-ready
Reliable parts identification with ranked fitment confidence
Generating multiple site layout plans with clear reasoning
Creating an experience that feels like a next-generation CAT product

📚 What we learned

How to orchestrate multimodal prompting for real-world tasks
The importance of operator-first UI design in industrial environments
How CAT inspections work, including PASS/FAIL/MONITOR patterns
Rapidly building integrated Next.js + GPT-4o systems
Designing for speed, clarity, and low cognitive load

🚀 What's next for CAT FieldIQ

Integrating real CAT parts catalogs and service manuals
On-device photo capture enhancements and AR overlays
Fleet-level dashboards for managers and inspectors
Offline-first mode for remote job sites
Auto-generated maintenance schedules across multiple machines
Expanding FieldIQ into a full platform for CAT operators worldwide

Built With

figma
next.js
openai
tailwind

Updates

William Jiang started this project — Mar 01, 2026 02:26 AM EST

Leave feedback in the comments!

Log in or sign up for Devpost to join the conversation.