Developing voice-enabled technology?

High-quality speech data on an easy-access data management platform

Enjoy 100% flexibility with acoustic and scenario setup. From inside a car to a dinner party; you name it, we’ll set it up.

Collect data in 35+ languages: we gather voice data locally and abroad; small scale or with hundreds of participants

What type of speech data do you need?

Tell us about your project and we’ll tailor a data collection plan to your exact needs.

CUSTOM SPEECH DATA COLLECTION SOLUTIONS

Globalme offers end-to-end speech data collection solutions to ensure your voice-enabled technology is ready for a diverse and multilingual audience.

We can take on any scope of project; from building a natural language corpus, to managing in-field data collection, transcription, and semantic analysis.

Using Globalme’s custom-built multilingual data management platform, our clients are able to access their data and the associated metadata quickly and efficiently through an easy-to-use API.

OUR RECENT SPEECH DATA COLLECTION PROJECTS

Globalme has been involved in the development of various exciting and innovative data collection projects for speech recognition devices.

Speech-Enabled Speaker System

4 countries
800 participants
600+ hours of audio.

Read the case study

In-Car Speech System

10 countries
3000 participants
1000+ hours of audio

Read the case study

Smart Fitness Wearable

5 languages
500 participants
125+ hours of audio
100,000 utterances annotated

Speech-Enabled Voice Assistant

9 countries
5400 participants
2000+ hours of audio

SPEECH DATA TO FIT YOUR NEEDS

We know that data collection projects for speech recognition come in all shapes and sizes.

Some projects have extremely specific acoustic or participant requirements that require tremendous amounts of planning, creativity, and innovation. We love those projects.

Other projects have simpler specifications, but may require fast turnaround time or an extremely high volume of speech samples. We love those projects too.

No matter what the scope and scale of your speech data collection project, Globalme can custom-tailor a collection solution to suit your needs.

SPEECH DATA SAMPLE DOWNLOADS

Download out our free speech data sample sets to see if our data solutions are a fit for your solution.

Alexa Wake Word Samples

24 custom audio samples
4 languages
Varying ages and genders

Download the data set

Phone Conversation Samples

Natural phone conversations
3 languages
Transcriptions included

Download the data set

WHAT OUR CLIENTS ARE SAYING

speech data collection Martin
Martin

Manager Data Collection at Nuance Communications

Globalme has provided exceptional services to the Data Collection team at Nuance Communications, Inc. They have supervised large scale data collection simultaneously in three different countries, consistently delivering quality data on or ahead of schedule. And this was done twice in short order – in Europe and in Asia. Especially notable is their dedication to constantly open lines of communication. We always receive prompt responses regardless of often significant differences in time zone. The team members are intelligent, professional, and passionate about the work they do. Their diligence and creativity in terms of problem-solving ensured the success of the project. Our continuing relationship with Globalme is a great asset to the company.

Datacollectionlogosnointel

What type of speech data do you need?

Tell us about your project and we’ll tailor a data collection plan to your exact needs.

Data Post-Processing

Multilingual Speech Transcription

Data Labeling & Classification

Image & Video Annotation

Testing

Speech Recognition Testing

Usability Testing

Requirements Testing

Out-of-Box Experience Testing