Add vespa as Vectordb #1676

gnanesh-16 · 2025-01-02T06:10:21Z

VespaDb Vector Database Implementation

Description

Summary of changes: A new VectorDb implementation using Vespa to support vector and hybrid search capabilities. This class integrates embedding and reranking functionalities, supports document upsert, and uses Vespa's querying system for search operations.
Related issues: This implementation fixes issue Vespa as VectorDB #1504.
Motivation and context: This feature allows efficient vector-based, keyword-based, and hybrid search operations, addressing the need for scalable and flexible vector databases in the application.
Environment or dependencies: Requires the Vespa library and its dependencies to be installed (vespa Python package). Ensure the Vespa instance is running locally or accessible at the specified URI.
Impact on AI/ML components: Enhances search capabilities by leveraging vector embeddings and hybrid query strategies. Performance improvements depend on embedding model accuracy and Vespa query efficiency.

Setup Instructions

Ensure Vespa is installed and running.
Install required Python dependencies:
```
pip install vespa phi
```
Verify the Vespa application is accessible at http://localhost:8080 or configure the correct URI.

Usage

Creating a VespaDb Instance

from vespa_db import VespaDb

vespa_db = VespaDb(
    uri="http://localhost:8080",
    app_name="my_vespa_app"
)
vespa_db.create()

Inserting Documents

from phi.document import Document

documents = [
    Document(name="Doc1", content="Sample content for document 1"),
    Document(name="Doc2", content="Sample content for document 2"),
]
vespa_db.insert(documents)

Performing Searches

results = vespa_db.search(query="Sample query", limit=10)
for result in results:
    print(result.name, result.content)

Hybrid Search

results = vespa_db.hybrid_search(query="Sample hybrid query", limit=5)

Dropping the Database

vespa_db.drop()

Development Notes

Test cases should validate:
- Vector search accuracy
- Hybrid search behavior
- Document upsertion and retrieval

manthanguptaa · 2025-01-02T10:56:03Z

Can you also add a cookbook for this?

gnanesh-16 · 2025-01-02T11:42:48Z

Can you also add a cookbook for this?

I haven’t added the cookbook for this yet. I will include it in a subsequent PR. Thank you for bringing this to my attention, @manthanguptaa.

manthanguptaa · 2025-01-02T11:53:29Z

cookbook/vectordb/vespa_db.py

Please make it consistent with the other cookbooks. Here is an example
https://github.com/phidatahq/phidata/blob/main/cookbook/vectordb/chroma_db.py

ok @manthanguptaa, I will raise another pull request with the mentioned considerations for the cookbook

manthanguptaa · 2025-01-06T11:43:31Z

phi/vectordb/vespa/vespa_db.py

Add steps on top of the file on how to run vespa

I will add @manthanguptaa all the steps on how to run Vespa at the top of the file. I will ensure the instructions are clear and easy to follow.

manthanguptaa · 2025-01-06T11:45:08Z

phi/vectordb/vespa/vespa_db.py

+try:
+    import vespa  # type: ignore
+except ImportError:
+    raise ImportError("`vespa` not installed.")


raise ImportError("vespanot installed. Please install usingpip install vespa")

or whatever the correct way is

manthanguptaa · 2025-01-06T11:45:39Z

cookbook/vectordb/vespa_db.py

@@ -0,0 +1,30 @@
+# install vespa - `pip install phi-vespa`


what's phi-vespa?

manthanguptaa · 2025-01-06T12:02:55Z

cookbook/vectordb/vespa_db.py

+vector_db = VespaDb(
+    app_name="recipes",
+    url="http://localhost:8080",
+    schema={
+        "fields": {
+            "text": {"type": "string"},
+            "embedding": {"type": "tensor(x[384])", "attribute": True},
+            "metadata": {"type": "string", "attribute": True}
+        }
+    }
+)


Your VespaDb class takes uri as param and not url. Please make sure to thoroughly test your code before raising a PR

Also there is no schema field in your VespaDb class

manthanguptaa

Your code isn't working at all. Please test before raising a PR. It will save a lot of to and fro on both ends. It is okay to use AI to code but you will have to test it on your end as well.

gnanesh-16 · 2025-01-06T14:25:52Z

Your code isn't working at all. Please test before raising a PR. It will save a lot of to and fro on both ends. It is okay to use AI to code but you will have to test it on your end as well.

Thank you for your feedback. @manthanguptaa I apologize for the oversight I've made and will ensure to test the code again by correcting those on my end before raising a PR moving forward.

manthanguptaa · 2025-01-22T17:04:39Z

Closing due to inactivity

Add vespa as Vectordb

8392546

Add cookbok for vespa_db

d84c363

manthanguptaa reviewed Jan 2, 2025

View reviewed changes

gnanesh-16 and others added 2 commits January 2, 2025 17:38

Add Vespa_db in Cookbook

32c8c73

Merge branch 'main' into vesp-db-b1

37cee2e

manthanguptaa reviewed Jan 6, 2025

View reviewed changes

cookbook/vectordb/vespa_db.py

@@ -0,0 +1,30 @@

# install vespa - `pip install phi-vespa`

Copy link

Contributor

manthanguptaa Jan 6, 2025

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

what's phi-vespa?

manthanguptaa reviewed Jan 6, 2025

View reviewed changes

manthanguptaa requested changes Jan 6, 2025

View reviewed changes

Merge branch 'main' into vesp-db-b1

8986d6f

manthanguptaa closed this Jan 22, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add vespa as Vectordb #1676

Add vespa as Vectordb #1676

gnanesh-16 commented Jan 2, 2025 •

edited

Loading

manthanguptaa commented Jan 2, 2025

gnanesh-16 commented Jan 2, 2025

manthanguptaa Jan 2, 2025

gnanesh-16 Jan 2, 2025

manthanguptaa Jan 6, 2025

gnanesh-16 Jan 6, 2025

manthanguptaa Jan 6, 2025

manthanguptaa Jan 6, 2025

manthanguptaa Jan 6, 2025

manthanguptaa Jan 6, 2025

manthanguptaa left a comment •

edited

Loading

gnanesh-16 commented Jan 6, 2025

manthanguptaa commented Jan 22, 2025

Add vespa as Vectordb #1676

Add vespa as Vectordb #1676

Conversation

gnanesh-16 commented Jan 2, 2025 • edited Loading

VespaDb Vector Database Implementation

Description

Setup Instructions

Usage

Creating a VespaDb Instance

Inserting Documents

Performing Searches

Hybrid Search

Dropping the Database

Development Notes

manthanguptaa commented Jan 2, 2025

gnanesh-16 commented Jan 2, 2025

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

manthanguptaa left a comment • edited Loading

Choose a reason for hiding this comment

gnanesh-16 commented Jan 6, 2025

manthanguptaa commented Jan 22, 2025

gnanesh-16 commented Jan 2, 2025 •

edited

Loading

manthanguptaa left a comment •

edited

Loading