Clione

GoLD: Grounded Language Dataset

Unverified
  • November 26, 2025, 01:09 AM
Last updated
Unknown
Release date
September 28, 2020
Size
825 samples | -- GB
License
Unknown
Tags
hri
language grounding
multi-modal

The Grounded Language Dataset (GoLD), a multimodal dataset of common household objects described by people using either spoken or written language. GoLD is comprised of RGB and depth point cloud images of 47 classes of objects in five high-level categories. It includes 8250 text and 4059 speech descriptions gathered with Amazon Mechanical Turk (AMT).

GoLD: Grounded Language Dataset

Modality
image
Format
PNG
WAV
TSV
Annotation
Label type natural language, audio
Annotators Crowdsourced from Amazon Mechanical Turk (AMT)
Description Images are labeled with their description in multiple formats: text, speech (audio), and automatically recognized speech derived from the audio files
Source
Author
Patrick Jenkins
Rishabh Sachdeva
Gaoussou Youssouf Kebe
Padraig Higgins
KasraDarvish
Edward Raff
Don Engel
John Winder
Francis Ferraro
Cynthia Matuszek
Institution
University of Maryland
Baltimore County
Booz Allen Hamilton
Johns Hopkins Applied Physics Laboratory
Contact
pjenk1@umbc.edu
rishabs1@umbc.edu
mb88814@umbc.edu
phiggin1@umbc.edu
kasradarvish@umbc.edu
edraff1@umbc.edu
donengel@umbc.edu
jwinder1@umbc.edu
ferraro@umbc.edu
cmat@umbc.edu

Citation

@inproceedings{kebe2021a,
  title = {A Spoken Language Dataset of Descriptions for Speech-Based Grounded Language Learning},
  author = {Gaoussou Youssouf Kebe and Padraig Higgins and Patrick Jenkins and Kasra Darvish and Rishabh Sachdeva and Ryan Barron and John Winder and Donald Engel and Edward Raff and Francis Ferraro and Cynthia Matuszek},
  booktitle = {Thirty-fifth Conference on Neural Information Processing Systems Datasets and Benchmarks Track (Round 1)},
  year = {2021},
  url = {https://openreview.net/forum?id=Yx9jT3fkBaD}
}

Similar datasets



Clione is an open repository for transparent dataset sourcing, supporting responsible research in robotics and machine learning.
Our mission is to make finding and understanding datasets easy and intutive.

About FAQs Contact