• audio

English (U.S.) conversations in-studio speech dataset

Features natural, unscripted conversations between adult native U.S. English speakers, each around 5 minutes long. Centered around various topics and role-playing scenarios, these dialogues capture spontaneous speech with all its complexities like pauses, overlaps, informal grammar and intonation.

Specifications

Modalities
Audio
Language
English (U.S.) [en-U.S.]
Total prompts
28
Total audio length
2:22h
Average recording length (in sec)
304.29
Participants
16
Group
Adults
Task category
Unscripted conversations
Data type
In-studio speech

Accelerate model development & training processes

Still searching for the right dataset? We can help.

Reach out and we’ll guide you to the right solution.

Explore our success stories

  • Evaluating a conversational AI model with a highly complex multimodal STEM dataset

    Discover how our off-the-shelf science, technology, engineering and mathematics (STEM) dataset contributed to enhancing scientific reasoning and visual processing capabilities in a chatbot model crafted by a leading-edge tech and AI company.

    • 4485Physics prompt-response pairs
    Read the case study
    case study complex multimodal dataset
  • Improving large language model logic and reasoning with a specialized fine-tuning dataset

    Explore how TELUS Digital created an off-the-shelf dataset to advance the capabilities of large language models (LLMs).

    • 50KSTEM-based prompt-response pairs created
    Read the case study
    case study specialized-fine-tuning-dataset
Item 1 of 2

Access the English (U.S.) conversations in-studio speech dataset

Connect with our experts for pricing and samples.

Request the dataset