Velvet logo

Velvet

Adobe
Final Cut
AWS

AI that passes the audiovisual Turing test

About Velvet

Velvet is a data research lab focused exclusively on multimodal models. Its purpose is to help artificial intelligence succeed in audiovisual interpretation. Velvet provides tools and resources for research in audiovisual communication, enabling exploration of new worlds of interaction. The platform primarily targets researchers, engineers, and AI professionals aiming to better understand and improve multimodal communication. By offering a library of datasets and evaluations for audiovisual communication, Velvet seeks to advance research in multimodality.

Source: Velvet official website

Key Features

Multimodal models

development of models that understand multiple modalities of communication

Open exploration

ability to explore new horizons in audiovisual research

Conversational interactions

tools to analyze and enhance multimodal interactions

Data processing pipeline

design and implementation of pipelines for audiovisual data processing

Research collaborations

opportunities to collaborate with AI experts

Performance evaluations

tools to assess the effectiveness of multimodal models

Data quality management

processes to ensure high-quality data

Research community

access to a vibrant community dedicated to multimodal research.

Practical Use Cases

Development of AI models for multimodal translation
Analysis of customer interactions through video data
Enhancement of recommendation systems based on audiovisual content
Starting from
Free
usage-based
Visit
API
Mobile App