Skip to content

Creation of Faiss vector database for enriching a prompts for Diffuser model

Notifications You must be signed in to change notification settings

Vitomir84/SemanticSearch

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

2 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

title emoji colorFrom colorTo sdk sdk_version app_file pinned short_description
Search Engine
🔥
green
red
streamlit
1.39.0
app.py
false
Semantic Search engine with Faiss

Check out the API of Search engine at https://huggingface.co/spaces/Vitomir/search_engine

For local deployment run

fast_api.py

Script creates swagger app with endpoints on localhost:8084. First endpoint return the top k semanticaly most similar prompts with query prompt. Second endpoint returns all similarites with query (only applicable for very small datasets).

Data Ingestion

data_reader.py

creates data of various prompts for encoding into vector database, from prompt-picture dataset. Local database encoded only 11000 prompts. Faiss index that is used is small and not optimized, used for experimental datasets. Search is brute force, not optimised.

Streamlit

streamlit run app.py

Should be run for streamlit app, it can be assessed locally on http://localhost:8501.

Docker

docker build -t my-streamlit-app .

from main dir

About

Creation of Faiss vector database for enriching a prompts for Diffuser model

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published