Data Catalog
Explore our comprehensive data catalog spanning audio, image, and video. All data is ethically sourced, quality-validated, and ready for foundation model training.
Hundreds of millions of hours covering all languages and scenarios.
| Data Type | Description | Scale |
|---|---|---|
| Chinese & English Raw Audio | Podcasts, conversations, interviews, radio dramas, novels, film/TV, anime, etc. | 300M+ hours |
| Minority Language Raw Audio | Spanish, Arabic, Portuguese, French, German, European languages, Japanese, Korean, Southeast Asian languages, etc. | 200M+ hours |
| Dialect Raw Audio | Cantonese, Sichuan, Northeastern, Shanghainese, Henan, etc. | 100M+ hours |
| Chinese & English TTS (Filtered) | 24k+ sample rate, 96–98% accuracy, podcasts/conversations/film & TV | 60M+ hours |
| Chinese & English 4o (Filtered) | Includes speaker info and timestamps, high-quality subsets | 60M+ hours |
| Music & Songs | Lossless music with lyrics and timing info, 128/256kbps+ | 200M+ tracks |
| Sound Effects | Tagged or described audio, wav/mp3 formats | 300M+ items |
Tens of billions of images covering all categories and scenarios across the web.
| Data Type | Description | Scale |
|---|---|---|
| Web-Scale Images | Xiaohongshu, WeChat, Toutiao and other platforms, without descriptions | 20B+ images |
| Image-Text Datasets | Open-source datasets (COYO, DataComp, OBELICS, etc.), image + text descriptions, interleaved | 20B+ pairs |
| Fine-Grained Categories | Animals, plants, cars, everyday objects, covering 10,000+ keywords | Billions |
| Premium Image Library | 2048×2048+ resolution with English labels and high-quality descriptions | 100M+ images |
| Real Portrait Data | 1080p+, single person, real photography | 100M+ images |
| Advertising Images | 1024+ resolution, with descriptions, ad copy, and creative info | 100M+ images |
| App Screenshot Data | Mobile/desktop software page screenshots across categories | 20M+ images |
Billions of video clips covering film, stock footage, digital humans, and all content types.
| Data Type | Description | Scale |
|---|---|---|
| Stock Video Footage | Visual China, Shutterstock, short video platforms, 720p+ | Billions of clips |
| HD Video Footage | 1080p+, 6–60s, no subtitles/watermarks, nature/sports/scenery | 1B+ clips |
| Film & TV / Short Drama | Movies, TV series, speeches, training, interviews, documentaries, Chinese & English | 40M+ hours |
| Digital Human (Raw) | Streamers, B站/YouTube/TikTok, Chinese & English focused, 720p+ | 100M+ hours |
| Digital Human (Clips) | Single person on screen, no scene cuts, 3–30s segments, 720p+ | 60M+ clips |
| Advertising Video | 1080p+, with descriptions, ad copy, and creative info | 50M+ clips |
| Open-Source Datasets | Pexels, WebVid-10M, Koala-36M, Panda-70M, etc. | Multiple major sets |
50B+
Data Records
300M+
Hours of Audio
30+
Languages
30+
Top-Tier AI Enterprise Clients