Clione

CrowdHuman Dataset

Unverified
  • December 13, 2025, 05:22 PM
  • November 24, 2025, 11:23 PM
Last updated
Unknown
Release date
April 30, 2018
Size
24370 samples | -- GB
License
Unknown
Tags
vision
human detection

The CrowdHuman dataset is large, rich-annotated and contains high diversity. There are a total of 470K human instances from the train and validation subsets, and 22.6 persons per image, with various kinds of occlusions in the dataset. Each human instance is annotated with a head bounding-box, human visible-region bounding-box and human full-body bounding-box. The cross-dataset generalization results of CrowdHuman dataset demonstrate state-of-the-art performance on previous dataset including Caltech-USA, CityPersons, and Brainwash without bells and whistles.

Images were obtained by crawling Google's image search engine with ∼150 keywords for query. Example keywords include “Pedestrians on the Fifth Avenue”, “people crossing the roads”, “students playing basketball” and “friends at a party”. These keywords cover more than 40 different cities around the world, various activities (e.g., party, traveling, and sports), and numerous viewpoints (e.g., surveillance viewpoint and horizontal viewpoint). The number of images crawled from a keyword is limited to 500 to make the distribution of images balanced. The authors crawled ∼60, 000 candidate images in total. The images with only a small number of persons, or with small overlaps between persons, are filtered. Of the total number, ∼25,000 final images were included in the CrowdHuman dataset.

CrowdHuman Dataset

Modality
image
Format
Unknown
Annotation
Label type bounding box
Source
Author
Shuai Shao
Zijian Zhao
Boxun Li
Tete Xiao
Gang Yu
Xiangyu Zhang
Jian Sun
Institution
Megvii Inc.
Contact
shaoshuai@megvii.com
zhaozijian@megvii.com
liboxun@megvii.com
xtt@megvii.com
yugang@megvii.com
zhangxiangyu@megvii.com
sunjian@megvii.com

Citation

            @article{shao2018crowdhuman,
  author = {Shao, Shuai and Zhao, Zijian and Li, Boxun and Xiao, Tete and Yu, Gang and Zhang, Xiangyu and Sun, Jian},
  journal = {arXiv preprint arXiv:1805.00123},
  title = {CrowdHuman: A Benchmark for Detecting Human in a Crowd},
  year = {2018}
}
        

Example usage

Similar datasets



Clione is an open repository for transparent dataset sourcing, supporting responsible research in robotics and machine learning.
Our mission is to make finding and understanding datasets easy and intutive.

About FAQs Contact