Add embeddings storage and search by dylanmcreynolds · Pull Request #44 · als-computing/splash_links

dylanmcreynolds · 2026-04-06T01:30:47Z

Many workflows want the ability to produce, store and search on embeddings.

This PR attempts to fit in that workflow by taking advantage of the postgres pgvector storage and search. This is a little experimental, frameworks like FAISS provide really fancy math for the creation and search of embeddings. But doing it in pgvector first gives us something to work with and evaluate.

This PR adds rest endpoint for storing and retrieving embeddings (because GraphQL requires json for data) and to be honest, right now for ease, the embeddings are in in json too. But in the future we can come up with a more efficient serialization strategy (probably involving numpy).

This also adds search for entities based on embeddings. This part is added to GraphQL.

There is also an embedding_models table where we can store information about which model each embedding is from, and match that to a URL in a service like MLFlow.

dylanmcreynolds added 2 commits April 5, 2026 18:18

Add vector storage and search

aeaa0e3

add embedding_model table

e6029b7

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add embeddings storage and search#44

Add embeddings storage and search#44
dylanmcreynolds wants to merge 2 commits intomainfrom
embeddings

dylanmcreynolds commented Apr 6, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

dylanmcreynolds commented Apr 6, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant