Auto-generate embeddings on entity create/update by EmanueleDeRossi1 · Pull Request #1639 · rhesis-ai/rhesis

EmanueleDeRossi1 · 2026-04-14T13:45:40Z

Purpose

Automatic embedding generation is triggered by SQLAlchemy ORM instance listeners on EmbeddableMixin (after_insert / after_update). Callers no longer need to manually enqueue embedding work after saving an entity. The pipeline passes identity + precomputed searchable text into EmbeddingService, the generator, and the Celery task, so background work does not depend on holding the ORM instance or re-loading it just to get text.

What changed

EmbeddableMixin (mixins.py)

Still provides searchable_text_changed(), which hashes to_searchable_text() and compares it to Embedding.text_hash rows (first embed, or stale when no row matches the current hash).
New: @event.listens_for(EmbeddableMixin, "after_insert", propagate=True) and "after_update" handlers build EmbeddingService(session) and call enqueue_embedding(...) with entity_type, entity_id, searchable_text, user_id, and organization_id.

EmbeddingService / generator / task

enqueue_embedding is keyword-based and passes primitives (plus searchable_text) instead of (entity, current_user).
EmbeddingGenerator.generate accepts optional searchable_text; when set, it can embed without loading the entity for text.
generate_embedding_task accepts searchable_text and forwards it to the generator.

Tests

Updated for the new enqueue_embedding and internal _execute_sync / _enqueue_async signatures; mocks are reset where commit runs and triggers the new listeners.

- Replace session commit hooks with EmbeddableMixin after_insert/update - Remove embedding/events.py; enqueue via EmbeddingService with primitives - Add searchable_text to generator and Celery task to avoid reloading entity

EmanueleDeRossi1 self-assigned this Apr 14, 2026

EmanueleDeRossi1 force-pushed the feat/embedding-event-listeners branch from 84f7ac8 to cdf4816 Compare April 15, 2026 14:43

EmanueleDeRossi1 added this to the Release 20 (ETA Apr 23) milestone Apr 16, 2026

EmanueleDeRossi1 added 7 commits April 16, 2026 11:49

docs: correct comment for sessions's expire_on_commit

67e3a19

docs: correct comment

8075d9d

feat(embeddings): add SQLAlchemy session hooks for auto-embedding

ff37e29

style: fix formatting

fc88b46

feat(embeddings): add searchable_text_changed() to EmbeddableMixin

9cebf06

refactor(embedding): use mixin listeners

7f7e0c9

- Replace session commit hooks with EmbeddableMixin after_insert/update - Remove embedding/events.py; enqueue via EmbeddingService with primitives - Add searchable_text to generator and Celery task to avoid reloading entity

test(embedding): align tests with enqueue_embedding and listener commits

7a13c1e

EmanueleDeRossi1 force-pushed the feat/embedding-event-listeners branch from 33d68b9 to 7a13c1e Compare April 16, 2026 09:49

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Auto-generate embeddings on entity create/update#1639

Auto-generate embeddings on entity create/update#1639
EmanueleDeRossi1 wants to merge 7 commits intomainfrom
feat/embedding-event-listeners

EmanueleDeRossi1 commented Apr 14, 2026 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

EmanueleDeRossi1 commented Apr 14, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Purpose

What changed

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

EmanueleDeRossi1 commented Apr 14, 2026 •

edited

Loading