GitHub - Vitomir84/SemanticSearch: Creation of Faiss vector database for enriching a prompts for Diffuser model

title	emoji	colorFrom	colorTo	sdk	sdk_version	app_file	pinned	short_description
Search Engine	🔥	green	red	streamlit	1.39.0	app.py	false	Semantic Search engine with Faiss

Check out the API of Search engine at https://huggingface.co/spaces/Vitomir/search_engine

For local deployment run

fast_api.py

Script creates swagger app with endpoints on localhost:8084. First endpoint return the top k semanticaly most similar prompts with query prompt. Second endpoint returns all similarites with query (only applicable for very small datasets).

Data Ingestion

data_reader.py

creates data of various prompts for encoding into vector database, from prompt-picture dataset. Local database encoded only 11000 prompts. Faiss index that is used is small and not optimized, used for experimental datasets. Search is brute force, not optimised.

Streamlit

streamlit run app.py

Should be run for streamlit app, it can be assessed locally on http://localhost:8501.

Docker

docker build -t my-streamlit-app .

from main dir

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
data		data
models		models
Dockerfile		Dockerfile
README.md		README.md
app.py		app.py
environment.yaml		environment.yaml
fast_api.py		fast_api.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

For local deployment run

Data Ingestion

Streamlit

Docker

About

Releases

Packages

Languages

Vitomir84/SemanticSearch

Folders and files

Latest commit

History

Repository files navigation

For local deployment run

Data Ingestion

Streamlit

Docker

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages