1. Data & AI Solutions
  2. Off-the-Shelf Datasets
  3. English (U.S.) conversations in-studio speech dataset
  • audio

English (U.S.) conversations in-studio speech dataset

Features natural, unscripted conversations between adult native U.S. English speakers, each around 5 minutes long. Centered around various topics and role-playing scenarios, these dialogues capture spontaneous speech with all its complexities like pauses, overlaps, informal grammar and intonation.

Specifications

Modalities
Audio
Language
English (U.S.) [en-U.S.]
Total prompts
28
Total audio length
2:22h
Average recording length (in sec)
304.29
Participants
16
Group
Adults
Task category
Unscripted conversations
Data type
In-studio speech

Accelerate model development & training processes

  • Authentic, unscripted conversations

    Unscripted, free-form conversations between two speakers on given topics and role-play scenarios (e.g., between airline customer support and a customer about flight delay and compensation) that captures natural linguistic features such as false starts, filler words and spontaneous turn-taking.

  • Rich speaker and scenario diversity

    Covering a broad range of conversation types and topics, the dataset captures rich variation in language use, tone, pacing and speaker interaction, helping models learn to generalize across different conversational contexts, speaker personalities and interaction styles.

  • High-fidelity audio with acoustic control

    Recorded in rooms at least 8' x 10' x 7' in size, with reverberation time (RT60) under 0.4 seconds at all frequencies, for minimal echo and background noise. Audio is delivered in 48 kHz, 24-bit mono .wav format, with 1-second silence padding and utterances normalized to -26 dB RMS active speech.

Still searching for the right dataset? We can help.

Reach out and we’ll guide you to the right solution.

Case Studies

Explore our success stories

  • Evaluating a conversational AI model with a highly complex multimodal STEM dataset

    Man using his mobile device with a chatbot illustration above the device.

    Discover how our off-the-shelf science, technology, engineering and mathematics (STEM) dataset contributed to enhancing scientific reasoning and visual processing capabilities in a chatbot model crafted by a leading-edge tech and AI company.


    • 4485Physics prompt-response pairs


    • 9606Math prompt-response pairs

    Download case study
  • Improving large language model logic and reasoning with a specialized fine-tuning dataset

    Person working at a laptop holding a mobile phone with an overlaid illustration of LLM features.

    Explore how TELUS Digital created an off-the-shelf dataset to advance the capabilities of large language models (LLMs).


    • 50KSTEM-based prompt-response pairs created


    • 300Highly-skilled contributors

    Download case study

Access the English (U.S.) conversations in-studio speech dataset

Connect with our experts for pricing and samples.