5 Weirdest Data Sets You Never Knew Existed!


A data set is a collection of related, discrete items of related data that may be accessed individually or in combination or managed as a whole entity.

A dataset, or data set, is simply a collection of data. The simplest and most common format for datasets you’ll find online is a spreadsheet or CSV format — a single file organized as a table of rows and columns. But some datasets will be stored in other formats, and they don’t have to be just one file.


A dataset in machine learning is a collection of data pieces that can be treated by a computer as a single unit for analytic and prediction purposes. This means that the data collected should be made uniform and understandable for a machine that doesn’t see data the same way as humans do.

With no further ado below are the 5 weirdest data sets.

5 Weirdest Data Sets 

1.UFO Reports

The search for intelligent life in the universe gets a big upgrade with the ufo-reports data set, which tracks over 80,000 sightings from the National UFO Reporting Center over the last century. The data collected includes geo-location and time-standardization for easy comparison between sightings for those studying extraterrestrial contact.

2.List of cats in movies

Felines in films have been around since 1903, according to this data set, which was compiled on OpenDataSoft, a portal for over 13,000 public datasets. This list can be sorted by director, producer, and year, and can be used to find out which decade was the most feline-friendly in film.

3. Registered meteorites that has impacted on Earth

In 2013, a meteorite fell in the Ural Mountains in Russia injuring about 10,000 people. Inspired by this, Ramon Martinez of publichealthintelligence.org created this data set that registered meteorites. The information is based on every meteorite recorded in the US Meteoritical Society data base. Those using the data can now determine how often an area has been hit and the size of the meteor that hit, possibly foreseeing what could be coming if the data has predictive value.

4.Abandoned Shopping Trolleys in Bristol Rivers

The Bristol City Council in England created this hyper-specific data set to identify the location of abandoned shopping carts in the rivers of their fair town. While the relevance is limited to the citizens of Bristol, it certainly helps those wishing to round-out the abandoned carts for an impromptu shopping trip.

5.Last words of Texas death row inmates

While it may seem morose, it’s also quite fascinating to examine this data set of the last words of inmates who were executed by the State of Texas. The data set includes information about the inmates and links to their last statements. While the data may not be as practical as some other sets on this list, the historical value of the information cannot be ignored.

I hope this article was quite entertaining and eye-opening at the same time.

