GoLD: Grounded Language Dataset

Unverified

November 26, 2025, 01:09 AM

Last updated

Unknown

Release date

September 28, 2020

Size

825 samples | -- GB

License

Unknown

GoLD: Grounded Language Dataset

Paper

Data

Modality

image

Format

PNG

WAV

TSV

Annotation

Label type natural language, audio

Annotators Crowdsourced from Amazon Mechanical Turk (AMT)

Description Images are labeled with their description in multiple formats: text, speech (audio), and automatically recognized speech derived from the audio files

Source

Author

Patrick Jenkins

Rishabh Sachdeva

Gaoussou Youssouf Kebe

Padraig Higgins

KasraDarvish

Edward Raff

Don Engel

John Winder

Francis Ferraro

Cynthia Matuszek

Institution

University of Maryland

Baltimore County

Booz Allen Hamilton

Johns Hopkins Applied Physics Laboratory

Contact

pjenk1@umbc.edu

rishabs1@umbc.edu

mb88814@umbc.edu

phiggin1@umbc.edu

kasradarvish@umbc.edu

edraff1@umbc.edu

donengel@umbc.edu

jwinder1@umbc.edu

ferraro@umbc.edu

cmat@umbc.edu

Citation

@inproceedings{kebe2021a,
  title = {A Spoken Language Dataset of Descriptions for Speech-Based Grounded Language Learning},
  author = {Gaoussou Youssouf Kebe and Padraig Higgins and Patrick Jenkins and Kasra Darvish and Rishabh Sachdeva and Ryan Barron and John Winder and Donald Engel and Edward Raff and Francis Ferraro and Cynthia Matuszek},
  booktitle = {Thirty-fifth Conference on Neural Information Processing Systems Datasets and Benchmarks Track (Round 1)},
  year = {2021},
  url = {https://openreview.net/forum?id=Yx9jT3fkBaD}
}

Similar datasets

SemanticKITTI

HARMONIC

UE-HRI

DROID: Distributed Robot Interaction Dataset

TartanGround

Clione

GoLD: Grounded Language Dataset

GoLD: Grounded Language Dataset

Citation

Similar datasets