Data Catalog

Multimodal Data at Unprecedented Scale

Explore our comprehensive data catalog spanning audio, image, and video. All data is ethically sourced, quality-validated, and ready for foundation model training.

Audio & Speech Data

Hundreds of millions of hours covering all languages and scenarios.

Data TypeDescriptionScale
Chinese & English Raw AudioPodcasts, conversations, interviews, radio dramas, novels, film/TV, anime, etc.300M+ hours
Minority Language Raw AudioSpanish, Arabic, Portuguese, French, German, European languages, Japanese, Korean, Southeast Asian languages, etc.200M+ hours
Dialect Raw AudioCantonese, Sichuan, Northeastern, Shanghainese, Henan, etc.100M+ hours
Chinese & English TTS (Filtered)24k+ sample rate, 96–98% accuracy, podcasts/conversations/film & TV60M+ hours
Chinese & English 4o (Filtered)Includes speaker info and timestamps, high-quality subsets60M+ hours
Music & SongsLossless music with lyrics and timing info, 128/256kbps+200M+ tracks
Sound EffectsTagged or described audio, wav/mp3 formats300M+ items

Image & Vision Data

Tens of billions of images covering all categories and scenarios across the web.

Data TypeDescriptionScale
Web-Scale ImagesXiaohongshu, WeChat, Toutiao and other platforms, without descriptions20B+ images
Image-Text DatasetsOpen-source datasets (COYO, DataComp, OBELICS, etc.), image + text descriptions, interleaved20B+ pairs
Fine-Grained CategoriesAnimals, plants, cars, everyday objects, covering 10,000+ keywordsBillions
Premium Image Library2048×2048+ resolution with English labels and high-quality descriptions100M+ images
Real Portrait Data1080p+, single person, real photography100M+ images
Advertising Images1024+ resolution, with descriptions, ad copy, and creative info100M+ images
App Screenshot DataMobile/desktop software page screenshots across categories20M+ images

Video & Motion Data

Billions of video clips covering film, stock footage, digital humans, and all content types.

Data TypeDescriptionScale
Stock Video FootageVisual China, Shutterstock, short video platforms, 720p+Billions of clips
HD Video Footage1080p+, 6–60s, no subtitles/watermarks, nature/sports/scenery1B+ clips
Film & TV / Short DramaMovies, TV series, speeches, training, interviews, documentaries, Chinese & English40M+ hours
Digital Human (Raw)Streamers, B站/YouTube/TikTok, Chinese & English focused, 720p+100M+ hours
Digital Human (Clips)Single person on screen, no scene cuts, 3–30s segments, 720p+60M+ clips
Advertising Video1080p+, with descriptions, ad copy, and creative info50M+ clips
Open-Source DatasetsPexels, WebVid-10M, Koala-36M, Panda-70M, etc.Multiple major sets

Proprietary Data Platform

Data Collection Platform

  • Full coverage data crawling across mainstream platforms
  • Keyword, homepage, and multi-dimensional targeted collection
  • Real-time data quality and compliance monitoring

Data Annotation Platform

  • Audio, image, video multi-modal annotation tasks
  • Automated pre-annotation + human refinement in parallel
  • Industry-leading accuracy through rigorous QC processes

Data Delivery Platform

  • Standardized data formats with multiple delivery methods
  • Data encryption ensuring customer data privacy
  • Continuous iterative updates with custom data product support

50B+

Data Records

300M+

Hours of Audio

30+

Languages

30+

Top-Tier AI Enterprise Clients