🏁 Inspiration

Field inspections are essential but time-consuming. Operators juggle photos, notes, paperwork, part lookups, and site planningβ€”all while working in fast-moving, high-risk environments.
We wanted to imagine what would happen if CAT Inspect and CAT AI Assistant were fused into one intelligent, mobile-first companion that transforms raw field input into instant, actionable insight.
That idea became CAT FieldIQ.

🚜 What it does

CAT FieldIQ is a multimodal AI assistant for field operations.
From a phone, operators can:

  • πŸ“Έ Take a machine photo β†’ get a full PASS/FAIL/MONITOR inspection report
  • 🎀 Speak notes β†’ convert to structured inspection logs
  • πŸ”§ Snap a part β†’ receive ranked part numbers with fitment certainty
  • πŸ—ΊοΈ Enter site details β†’ generate multiple optimized layout plans with ASCII diagrams
  • πŸ“ Create a unified report combining all results

It’s fast, structured, operator-friendly, and works entirely in the browser.

πŸ—οΈ How we built it

  • Next.js 14 (App Router) for a lightweight mobile-first interface
  • Tailwind CSS with a CAT-themed palette for a native feel
  • OpenAI GPT-4o for:
    • Vision-based inspections
    • Part identification
    • Site layout reasoning
    • Report summarization
  • Next.js API routes to structure JSON outputs and enforce safety
  • Client-side React components for smooth mobile workflows, image previews, and result cards

Our focus was clarity and reliability: simple screens, minimal text, strong visual hierarchy, and single-tap actions.

⚠️ Challenges we ran into

  • Ensuring GPT-4o returned strict JSON for inspection and parts data
  • Designing a UI that felt native on mobile, not like a shrunk web form
  • Getting layout plans to produce readable, consistent ASCII diagrams
  • Handling image upload constraints across browsers
  • Managing multiple multimodal flows (vision, voice, text) under hackathon time pressure

πŸ† Accomplishments we're proud of

  • A fully working, multimodal AI assistant built end-to-end
  • Clean mobile interface designed for real CAT field workflows
  • Structured AI reports that genuinely feel operator-ready
  • Reliable parts identification with ranked fitment confidence
  • Generating multiple site layout plans with clear reasoning
  • Creating an experience that feels like a next-generation CAT product

πŸ“š What we learned

  • How to orchestrate multimodal prompting for real-world tasks
  • The importance of operator-first UI design in industrial environments
  • How CAT inspections work, including PASS/FAIL/MONITOR patterns
  • Rapidly building integrated Next.js + GPT-4o systems
  • Designing for speed, clarity, and low cognitive load

πŸš€ What's next for CAT FieldIQ

  • Integrating real CAT parts catalogs and service manuals
  • On-device photo capture enhancements and AR overlays
  • Fleet-level dashboards for managers and inspectors
  • Offline-first mode for remote job sites
  • Auto-generated maintenance schedules across multiple machines
  • Expanding FieldIQ into a full platform for CAT operators worldwide

Built With

  • figma
  • next.js
  • openai
  • tailwind
Share this project:

Updates