About Open-Vision-Language
This is a website with an easy-to-remember sub-domain name, to conveniently host scientific projects and results about vision and language.
The current member projects of this domain includes:
- SuTI: An in-context subject-driven text-to-image generator that draws subject-specific images without fine-tuning.
- OVEN: Datasets and Empirical Results for a novel task of ``Open-domain Visual Entity Recognition’’.
- InfoSeek: A new VQA dataset that evaluates multimodal LLMs on answering visual infomation-seeking questions.