DATA COLLECTION SERVICES

data-collection-car

Data solutions for emerging tech.

Globalme provides end-to-end data services to train and test the latest AI technologies.

What data can we provide for you?

data-collection-car

OUR DATA SERVICES

Check out our individual data services or keep scrolling for a full overview.

Data Post-Processing

Multilingual Speech Transcription

Data Labeling & Classification

Image & Video Annotation

Testing

Speech Recognition Testing

Usability Testing

Requirements Testing

Out-of-Box Experience Testing

DATA DRIVES INNOVATION

Data is the most critical element in the development of machine-learning technology. Your data needs to be:

natural-data

Natural

Real-world products require real-world data. To properly train your AI, you’ll need data from the environments in which your product or solution will actually be used.

custom-data

Customized

Whether it’s audio of a certain frequency, images under certain lighting, or videos at a particular angle, most machine learning projects require highly specific or varied input data.

scalable-data

Scalable

Many machine learning projects require huge quantities of data from all around the world, collected on a tight timeline. Remote data collection makes that lofty goal a reality.

WHAT TYPES OF DATA DO WE COLLECT?

Globalme collects a wide variety of data for AI-powered devices, including fitness wearables, voice assistants, and autonomous vehicles.

Speech Data

Custom speech data in over 35 languages, flexible to any acoustic or scenario setup—from inside a car, in a recording studio, or at a dinner party.

Image Data

Train your computer vision product with unique scenario setups or remotely collected images of faces, traffic, handwriting, documents, and more.

Video Data

Enhance object and facial recognition technologies with videos of human interactions, traffic patterns, and more—in naturally occurring or highly controlled environments.

HOW DO WE COLLECT DATA?

We have lots of ways of collecting lots of data.

In-Person Data Collection

Projects with complex requirements—like a specific microphone or camera—are best-suited for in-person data collection.

We travel across the world to collect specialized data in different languages and countries. We’ve recorded data in cars, warehouses, while athletes trained, and even at dinner parties.

If you need a specialized scenario with specific requirements, we can make it happen.

Remote Data Collection

Need lots of data—and fast? Your project is likely best-suited for remote data collection.

We’ve built the technology to quickly gather a wide variety of data from a worldwide database of diverse users from our proprietary mobile app.

Whether you need thousands of speech samples in a particular accent, pictures of receipts in a specific country, or videos of everyday life, Globalme can provide high-quality, thoroughly vetted data to suit the needs of your project.

Telephone Data Collection

In need of conversational data? We’ve developed a phone call recording system that makes it easy to capture telephone conversations from one or multiple phone lines from participants in any language or location.

Check out our phone conversation data sample for a preview of our data collection capabilities.

DATA POST-PROCESSING SERVICES

It doesn’t end at collection. We provide full data processing services to hand deliver perfectly annotated data.

Multilingual Speech Transcription

Our native transcribers provide accurate phonetic transcriptions according to your unique requirements—including custom noise-markers and segmentation rules.

Data Labeling & Classification

Once transcribed, the speech and video data is tagged and bucketed into various domains. Everything is classified based on the product’s feature set and scope.

Image & Video
Annotation

After image or video collection, we can annotate the objects within each given image or frame—based on your requirements and needed file formats.

TESTING FOR EMERGING TECHNOLOGIES

Once you’ve built your AI-powered product, we’ll help you test your device in the hands of real users.

Speech Recognition Testing

Test the accuracy of your speech recognition products with validation data from 35+ languages.

Usability
Testing

We’ll test your product in a natural setting to bring to light potential issues before your product hits the shelves.

Out-of-Box Experience Testing

You only have one chance at a first impression. We test the user’s first interaction with your product in real time.

Requirements
Testing

Validation data sets, automation and manual testing and more to evaluate your product in a pass/fail setting.

We get the data. You build the future.

Tell us about your data collection needs and we’ll provide a full end-to-end solution.

Get started now.

WHAT MAKES GLOBALME DIFFERENT?

Here’s why many of the world’s most successful companies turn to Globalme for their data collection needs.

end-to-end

We provide full end-to-end data collection services—including project management, collection, post-processing, annotation, and delivery.

customized

We’ve developed custom tools and processes that give us the flexibility to collect data to meet your exact requirements.

35+ languages

Whether it’s speech collected in-field or online, we’ve built the infrastructure to access a global network of diverse participants.

end-to-end

We provide full end-to-end data collection services—including project management, collection, post-processing, annotation, and delivery.

customized

We’ve developed custom tools and processes that give us the flexibility to collect data to meet your exact requirements.

35+ languages

Whether it’s speech collected in-field or online, we’ve built the infrastructure to access a global network of diverse participants.

quality

Machine learning feeds on high-quality data. That’s why our data is heavily reviewed for quality and collected to your exact specifications from the start.

efficient

Our proprietary data post-processing and delivery platform allows us to share the field or remote-collected data we collect efficiently in real-time.

experienced

After over 10 years in business, Globalme is a trusted partner to many of the world’s most prominent emerging technology companies.

quality

Machine learning feeds on high-quality data. That’s why our data is heavily reviewed for quality and collected to your exact specifications from the start.

efficient

Our proprietary data post-processing and delivery platform allows us to share the field or remote-collected data we collect efficiently in real-time.

experienced

After over 10 years in business, Globalme is a trusted partner to many of the world’s most prominent emerging technology companies.

CLIENT CASE STUDIES

See how we’ve helped bring cutting-edge technology to the world stage.

sonos logo

Dialect and Accent Speech Data Collection

Sonos wanted to integrate wireless speakers and smart home assistants. By collecting speech and accent data across three countries, Sonos was able to fine-tune their voice recognition engines to provide their users with a better voice experience.

19 Accents. 799 Participants. 3 Countries.

Read the case study

Dialect and Accent Speech Data sonos

Multilingual Voice Data Collection

Nuance was developing the next-generation of in-car speech recognition technology. We helped them collect and process hundreds of hours of voice data in various languages, demographics, and locations around the world.

15 Languages. 2000 Participants. 600 Hours of Data.

Read the case study

voice recognition data collection
sonos logo

Dialect and Accent Speech Data Collection

Sonos wanted to integrate wireless speakers and smart home assistants. By collecting speech and accent data across three countries, Sonos was able to fine-tune their voice recognition engines to provide their users with a better voice experience.

19 Accents. 799 Participants. 3 Countries.

Multilingual Voice Data Collection

Nuance was developing the next-generation of in-car speech recognition technology. We helped them collect and process hundreds of hours of voice data in various languages, demographics, and locations around the world.

15 Languages. 2000 Participants. 600 Hours of Data.

WHAT OUR CLIENTS ARE SAYING

Martin - Manager Data Collection at Nuance Communications
Martin

Manager, Data Collection, Nuance Communications

“Globalme has provided exceptional services to the Data Collection team at Nuance Communications, Inc. They have supervised large scale data collection simultaneously in three different countries, consistently delivering quality data on or ahead of schedule. And this was done twice in short order – in Europe and in Asia.

Especially notable is their dedication to constantly open lines of communication. We always receive prompt responses regardless of often significant differences in time zone. The team members are intelligent, professional, and passionate about the work they do. Their diligence and creativity in terms of problem-solving ensured the success of the project. Our continuing relationship with Globalme is a great asset to the company.”

Get the data you need.

Join our list of successful clients. Contact us now to get started.

What can we do for you?

DATA COLLECTION GUIDES

Download these free guides for expert advice on data collection.

smart home AI white paper

Building an Advanced Smart Home AI

Can we make the integration of advanced AI in smart homes a reality? Find out as we explain how advanced AI can be made possible with the help of custom data collection.

Download
globalme data collection guide cover with moon

The Ultimate Guide to Data Collection

Developing state-of-the-art technologies could mean searching for the right data. Download this guide to learn about the process of data collection for emerging technologies.

Download
building chatbots with zero experience white paper

Building Chatbots with Zero Experience

Can a chatbot be configured with no machine learning experience? This white paper shows how a Globalme team member configured Lex, Dialogflow, Watson, and Rasa with minimal chatbot experience.

Download

FREE SAMPLE DATASET DOWNLOADS

Check out a sample of the data we provide with these free downloads.

eye gaze sample dataset

Eye Gaze Sample Dataset

The quality of your eye gaze data helps you build your biometric-enabled device to understand the human behavior of eye movement. Get a sample of high-quality eye gaze data.

Download
alexa wake word sample dataset

Alexa Wake Word Sample Dataset

Listen to samples of multilingual Alexa wake word voice commands to hear why custom data collection can help you create a better user experience for an international audience.

Download
Road, Car, and People Sample Dataset Cover

Road, Car, and People Sample Dataset

Whether you’re building security surveillance tech or autonomous cars, the visual data of roads, cars, and people give devices a set of eyes. Get your sample dataset here.

Download

WHO IS GLOBALME?

globalme technology localization services

Globalme is a language technologies company, founded in 2007. We recently joined the Summa Linguae Technologies family of companies in late 2019.

We provide localization, data collection, and managed services—specializing in working with the latest emerging technologies.

We are headquartered in Vancouver, BC, Canada, and have been listed as one of BC’s fastest growing companies for 5 straight years. We have an incredibly diverse staff of 33 nationalities, speaking over 30 languages.

Meet Our Team

Ready to get started?

We’re excited to get started on your project. Let us know what you’re looking for and we’ll provide a free estimate.

Get a free estimate.