Question 1

What is Moss?

Accepted Answer

Moss is a real-time semantic search runtime for AI agents, voice agents, copilots, and multimodal apps. It delivers sub-10ms lookups with zero infrastructure. Built in Rust and WebAssembly, Moss runs search inside your agent runtime — browser, edge, on-device, or cloud — so retrieval is local, fast, and private.

Question 2

How fast is Moss compared to traditional vector databases?

Accepted Answer

Moss delivers sub-10ms lookups by running search locally inside your agent runtime. Traditional vector databases require network round-trips at 100–500ms per query. With Moss, there are no hops, no lag, and no infrastructure to manage — your agent retrieves context in under 10 milliseconds.

Question 3

What programming languages does Moss support?

Accepted Answer

Moss provides official SDKs for JavaScript/TypeScript (npm install @inferedge/moss) and Python (pip install inferedge-moss). Both SDKs support the full feature set including index creation, document management, semantic search, hybrid search, and auto-refresh.

Question 4

Does Moss work offline and on-device?

Accepted Answer

Yes. Moss is built with a local-first architecture. Once an index is loaded, all queries run entirely in-memory on the device — no network required. It works in browsers via WebAssembly, on mobile devices, desktop apps, and edge servers. Data syncs automatically when connectivity is restored.

Question 5

How does Moss handle data privacy?

Accepted Answer

Moss is privacy-by-architecture. Data stays on-device by default with no round-trips to cloud databases. There is no centralized storage of user queries or data. This makes Moss compliance-friendly for regulated industries including healthcare (HIPAA) and enterprise (SOC2).

Question 6

What is the pricing for Moss?

Accepted Answer

Moss offers a free Developer tier with $5/month in free credits and unlimited local queries. Paid plans include Hobbyist at $30/month with continuous sync and unlimited projects, Start-up at $200/month with cloud search and priority support, and Enterprise with custom pricing, 99.9% SLA, SSO, and SOC2/HIPAA compliance.

Question 7

How do I get started with Moss?

Accepted Answer

Getting started takes minutes: install the SDK (npm install @inferedge/moss or pip install inferedge-moss), sign up at the Moss Portal to get your project credentials, then create an index and start querying in just a few lines of code. No infrastructure setup required.

Question 8

What types of search does Moss support?

Accepted Answer

Moss supports three search modes: pure semantic search (vector similarity), pure keyword search (BM25), and hybrid search that combines both. The alpha parameter controls the blend — 1.0 for pure semantic, 0.0 for pure keyword, and values in between for hybrid. This lets you tune retrieval quality for your specific use case.

Real-time retrieval for Conversational AI.

Three steps to production

Connect Once

Moss Handles the Rest

Search Every Critical Path

Get Started in Minutes

100x faster than the alternatives

What teams build with Moss

Voice Agents and Copilots

Frequently asked questions

What is Moss?

How fast is Moss compared to traditional vector databases?

What programming languages does Moss support?

Does Moss work offline and on-device?

How does Moss handle data privacy?

What is the pricing for Moss?

How do I get started with Moss?

What types of search does Moss support?

Ship Real-Time Retrieval in Minutes

Loading

Real-time retrieval for Conversational AI.

Three steps to production

Connect Once

Moss Handles the Rest

Search Every Critical Path

Get Started in Minutes

100x faster than the alternatives

What teams build with Moss

Voice Agents and Copilots

Frequently asked questions

What is Moss?

How fast is Moss compared to traditional vector databases?

What programming languages does Moss support?

Does Moss work offline and on-device?

How does Moss handle data privacy?

What is the pricing for Moss?

How do I get started with Moss?

What types of search does Moss support?

Ship Real-Time Retrieval in Minutes