A web application for creating your own image dataset
Use DatGen to:
- Easily create your own customized image dataset through an intuitive web application.
- Decide which and how objects appear in your dataset using natural language input.
- Make use of human annotated public image datasets.
- Leverage multimodal AI systems capable of detecting and generating images matching your needs.
Head over here and start building your dataset. You will be able to choose, using text input and dropdown menus:
- The objects that compose your dataset.
- Their size with respect to the rest of the image.
- Their visual attributes.
- Their location.
- The number of images you want for each object.
- Whether to match the images in their contrast values.
DatGen will take care of scraping public image databases that match your input specifications, or leverage a multimodal deep generative model to create the images that you need.
You dataset will be generated in a matter of minutes and ready to download in compressed format!
If you are experiencing an issue when using DatGen, or you simply want to leave some feedback, please open an issue.
- Clone this repository on your local machine
git clone https://github.com/martinagvilas/DatGen.git
- Create a conda environment (optional)
conda create --name datgen python=3.9
conda activate datgen
- Install dependencies
pip install -r requirements.txt
- Install DatGen source code
pip install -e .
DatGen is built on top of:
- Streamlit, an open source tool for building web applications.
- CLIP, a text-language deep learning model.
- Dalle-mini, a multimodal deep generative model.