-
Notifications
You must be signed in to change notification settings - Fork 0
/
DatasetDescriptions.csv
We can make this file beautiful and searchable if this error is corrected: Illegal quoting in line 13.
95 lines (95 loc) · 30.2 KB
/
DatasetDescriptions.csv
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
Your name; Your GitHub user; Dataset name; Dataset URL; Dataset brief description
Antoine Tribout; atribout; Amazon US Full Products List; https://www.kaggle.com/datasets/shubh932/amazon-us-full-categories-list; List of all amazon us products with their categories, their context free names and the number of items produced.
JingleiXu; JingleiXu97; Global Airports-Locations of airports with international travel; https://datacatalog.worldbank.org/search/dataset/0038117; Airport locations were extracted and mapped from a repository of air traffic flow, and international airports were extracted.
Yuxiao Xiong; estrellaxyx; Best Books Ever Dataset; https://zenodo.org/record/4265096#.YyNPQy1JmRs; The dataset contains 25 variables and 52478 records corresponding to books on the GoodReads Best Books Ever list (the larges list on the site).The data was retrieved in two sets, the first 30000 books and then the remainig 22478. Dates were not parsed and reformated on the second chunk so publishDate and firstPublishDate are representet in a mm/dd/yyyy format for the first 30000 records and Month Day Year for the rest.,,,,
Stephan Wolters; sw-upm; Brain Computing Interface (BCI) Ontology; http://liveschema.eu/dataset/lov_bci; The ontology defines a minimalist and simple abstract metadata foundational model for real-world BCI applications that monitors human activity in any scenario.,,,,,
Javier Mora; lndulgence; Diveboard - Scuba diving citizen science observations;https://doi.org/10.15468/tnjrgy; Dataset containing crowdsourced observations on ocean floor fauna and flora recorded by citizen scientist scuba divers
Jiayun Liu; DCY1117; Norway catch records dataset @PSNC; https://lod-cloud.net/dataset/Catch%20Record%20(2014-%202019); Catch records data from Norway (2014-2019), transformed and published as Linked Data
Yichun Sun; yichun77; Animals Characteristics of the Fuengirola bioparc in Spain; https://zenodo.org/record/6433541#.YyNubexBzAM; In this dataset we will find the family, species, order, habitat, classification, area, diet, gestation and degree of threat of each animal found in the Fuengirola Bioparc.
Nicolas Amigo Sañudo; nia9229; Football Data from Transfermarkt; https://www.kaggle.com/datasets/davidcariboo/player-scores/download?datasetVersionNumber=177; Clean, structured and automatically updated football data from Transfermarkt, including * 40,000+ games from many seasons on all major competitions * 300+ clubs from those competitions * 20,000+ players from those clubs * 900,000+ player appearance records from all games
Yuxiao Xiong; estrellaxyx; Best Books Ever Dataset; https://zenodo.org/record/4265096#.YyNPQy1JmRs; The dataset contains 25 variables and 52478 records corresponding to books on the GoodReads Best Books Ever list (the larges list on the site).The data was retrieved in two sets, the first 30000 books and then the remainig 22478. Dates were not parsed and reformated on the second chunk so publishDate and firstPublishDate are representet in a mm/dd/yyyy format for the first 30000 records and Month Day Year for the rest.
Gabriela Argüelles Terrón; gabyarte; Linked Movie Database; https://old.datahub.io/dataset/linkedmdb; A linked-data collection of movies, actors, directors, and the relationships between them.
Alejandro Macari; macari-dev; Film Permits; https://data.cityofnewyork.us/City-Government/Film-Permits/tg4x-b46p/data ; Permits are generally required when asserting the exclusive use of city property, like a sidewalk, a street, or a park.
Guillaume Champtoussel; GuillaumeChamp; Top PC Games based on the metascore (metacritic.com); https://doi.org/10.5281/zenodo.3712225;" The dataset contains 6 columns and 199 rows of pc games with the highest metascore available on metacritic. The title and publisher column contains the title of the game and the name of the company that published the game. The rating column contains the rating of the game with ""-"" which means there is no rating. The user_score column contains the average rating given by the user to the game. The metascore column contains the results of the assessment conducted by the metacritics of the game."
Erick Cedeno;erick4556;Summer Sports Experience;https://data.cityofnewyork.us/Recreation/Summer-Sports-Experience/xeg4-ic28/data;Activity and attendance records from the Summer Sports Experience program, which provides sports instruction to children ages 8 to 14.
Guillaume Champtoussel; GuillaumeChamp; Top PC Games based on the metascore (metacritic.com); https://doi.org/10.5281/zenodo.3712225;" The dataset contains 6 columns and 199 rows of pc games with the highest metascore available on metacritic. The title and publisher column contains the title of the game and the name of the company that published the game. The rating column contains the rating of the game with ""-"" which means there is no rating. The user_score column contains the average rating given by the user to the game. The metascore column contains the results of the assessment conducted by the metacritics of the game."
Anne-Fleur Kerhousse; afkerhousse; Madrid weather; https://www.kaggle.com/datasets/rober2598/madrid-weather-dataset-by-hours-20192022; The dataset contains 9 columns and 27024 rows representing weather conditions from 2019 to 2022 every hour of the day.
Ariel Ratzonel; ArielRatzonel00;New York Times - Linked Open Data; https://lod-cloud.net/dataset/nytimes-linked-open-data; The dataset contains the news vocabularies developed by New York Times in the last 150 years, in a linked data format.
Mihai Cristian Ursa; mihai-cristian3; Nights spent at tourist accommodation establishments - monthly data; https://ec.europa.eu/eurostat/web/products-datasets/-/tour_occ_nim; The datasets contains monthly information about accommodation statistics from UE countries: capacity in tourist accomodation establishments (number of establishments, number of bedrooms and number of bedplaces) and occupancy in tourist accommodation establishments (nights spent, arrivals and occupancy rates of bed places).
Nataly Matias; NatalyMMR; Enviromental poluttion; https://datos.alcobendas.org/dataset/calidad-del-aire/resource/58c1f2e4-6b9c-43e2-bc43-faae4c0ec2a5; The dataset contains 18 columns about enviromental pollution, they are atmospheric data collected at the Alcobendas station and the format is RDF.
Manuel Leira García-Baamonde; mlupm; Museos de la ciudad de Madrid; https://datos.madrid.es/portal/site/egob/menuitem.c05c1f754a33a9fbe4b2e4b284f1a5a0/?vgnextoid=118f2fdbecc63410VgnVCM1000000b205a0aRCRD&vgnextchannel=374512b9ace9f310VgnVCM100000171f5a0aRCRD&vgnextfmt=default; Dataset containing all the museus existing inside the city of Madrid. The dataset is available in RDF format and is updated every 15 days.
Carlos Sánchez Velázquez; carsanvel; Telegraphis Linked Open Data - World countries; http://telegraphis.net/data/countries/; The dataset contains linked data from all the world countries their capitals, currencies, land area, and more, it cand be downloaded as RDF and has an endpoint for SPARQL
Ignacio Blasco; Blas47; Formula 1 racing statistics from the beginning to the present day; https://zenodo.org/record/4662943#.YygqmC8lOgR; Dataset in which we find the history of Formula 1 races from its beginnings to the present day.
Ricardo Carvalho; ricardoomrques; TMDB 5000 Movie Dataset; https://www.kaggle.com/datasets/tmdb/tmdb-movie-metadata; Metadata on ~5,000 movies from TMDb
Luis Arconada Sousa; dasousarules; Top 5000 Albums of All Time; https://www.kaggle.com/datasets/michaelbryantds/top-5000-albums-of-all-time-rateyourmusiccom; The dataset contains 5000 ranked albums including ranking, album name, artist name, release date, genres, descriptors, average rating, number of ratings, and number of reviews.
Andrei Saavedra; saavedrAndrei; Mobile apps to fight the COVID-19 crisis; https://data.jrc.ec.europa.eu/dataset/c14cb1db-c31b-4bb9-95d2-ec7148708931;" The ""Mobile apps to fight the COVID-19 crisis"" introduces the different mobile apps published around the world to fight the impact of COVID-19 crisis."
Luis Ortiz Benito; LUISOB33; Credit score classification; https://www.kaggle.com/datasets/parisrohan/credit-score-classification; The dataset clasifies different credits according to their performance into different categories, it contains a total of 27 columns and with a total of 150000 instances (100000 alredy clasified for training and 50000 for test issues).
Maximilian Virkus; maxvirkus; Erasmus mobility statistics 2014 - 2018; https://data.europa.eu/data/datasets/erasmus-mobility-statistics-2014-2018?locale=en; This dataset contains the raw data for Erasmus+ mobility for students and staff in 2014-2018.
Paula Robles López; paumurl; Fuentes de agua de Madrid; https://datos.madrid.es/portal/site/egob/menuitem.c05c1f754a33a9fbe4b2e4b284f1a5a0/?vgnextoid=b8b2e44003b95510VgnVCM1000001d4a900aRCRD&vgnextchannel=374512b9ace9f310VgnVCM100000171f5a0aRCRD; This dataset details the water supplies in Madrid and where they can be located.
Alejandro Basco; abasco21; EVENTSKG; https://saidfathalla.github.io/EVENTS-Dataset/EVENTSKG-40.rdf; Dataset of scientific events, containing historical data about the publications, submissions, start date, end date, location and homepage for 25 top-prestigious events series.
Jorge Lizcano;Jorgelzn;7,000 Labeled Pokemon;https://www.kaggle.com/datasets/lantian773030/pokemonclassification;The dataset contains 7000 images of pokemon organised in folders and labeled by name
Andrea Pisani; andreapisa9; Wiki-MID Dataset; https://doi.org/10.6084/m9.figshare.6231326.v2; Multi-domain interests set to train and test Recommendation Systems
Alejandro de la Cruz López; acrulopez; The data.gov.au Dataset Ontology; http://linked.data.gov.au/def/dataset; Contains elements which describe the publication, update, origin, governance, spatial and temporal coverage and other contextual information about the dataset. The ontology also covers aspects of organisational custodianship and governance.
Natalia Bagnoli; NataliaBagnoli; Linked Sensor Data (Kno.e.sis); https://old.datahub.io/dataset/knoesis-linked-sensor-data; Datasets for sensors and sensor observations, created at Kno.e.sis Center, and converted from weather data at Mesowest. Contains descriptions of 20 thousand weather stations and 160 million observations.
Delia Moreno; deliamoreno2295; Libraries and bibliobuses in the city of Madrid; https://datos.madrid.es/portal/site/egob/menuitem.c05c1f754a33a9fbe4b2e4b284f1a5a0/?vgnextoid=ed35401429b83410VgnVCM1000000b205a0aRCRD&vgnextchannel=374512b9ace9f310VgnVCM100000171f5a0aRCRD&vgnextfmt=default;This dataset contains the public libraries and bibliobuses of Madrid. Additionally, you can find information on services, location and times, these data are useful for geolocation.
Maria Lara;mlaratrullenque;Madrid Salud. Estadísticas centro de protección animal; https://datos.madrid.es/portal/site/egob/menuitem.c05c1f754a33a9fbe4b2e4b284f1a5a0/?vgnextoid=6f70787df969b410VgnVCM2000000c205a0aRCRD&vgnextchannel=374512b9ace9f310VgnVCM100000171f5a0aRCRD&vgnextfmt=default ; This dataset contains the statistics of the different animals that were kept at the animal protection centre as well as the urban colonies of these animals
Your name; Your GitHub user; Dataset name; Dataset URL; Dataset brief description
Zied Gobji; ZiedGOBJI; Los Angeles Parking Citations; https://www.kaggle.com/datasets/cityofLA/los-angeles-parking-citations; Parking citations with latitude / longitude (XY) in US Feet coordinates according to the NAD1983StatePlaneCaliforniaVFIPS0405_Feet projection.
JingleiXu; JingleiXu97; Global Airports-Locations of airports with international travel; https://datacatalog.worldbank.org/search/dataset/0038117; Airport locations were extracted and mapped from a repository of air traffic flow, and international airports were extracted.
Yuxiao Xiong; estrellaxyx; Best Books Ever Dataset; https://zenodo.org/record/4265096#.YyNPQy1JmRs; The dataset contains 25 variables and 52478 records corresponding to books on the GoodReads Best Books Ever list (the larges list on the site).The data was retrieved in two sets, the first 30000 books and then the remainig 22478. Dates were not parsed and reformated on the second chunk so publishDate and firstPublishDate are representet in a mm/dd/yyyy format for the first 30000 records and Month Day Year for the rest.,,,,
Stephan Wolters; sw-upm; Brain Computing Interface (BCI) Ontology; http://liveschema.eu/dataset/lov_bci; The ontology defines a minimalist and simple abstract metadata foundational model for real-world BCI applications that monitors human activity in any scenario.,,,,,
Javier Mora; lndulgence; Diveboard - Scuba diving citizen science observations;https://doi.org/10.15468/tnjrgy; Dataset containing crowdsourced observations on ocean floor fauna and flora recorded by citizen scientist scuba divers
Jiayun Liu; DCY1117; Norway catch records dataset @PSNC; https://lod-cloud.net/dataset/Catch%20Record%20(2014-%202019); Catch records data from Norway (2014-2019), transformed and published as Linked Data
Yichun Sun; yichun77; Animals Characteristics of the Fuengirola bioparc in Spain; https://zenodo.org/record/6433541#.YyNubexBzAM; In this dataset we will find the family, species, order, habitat, classification, area, diet, gestation and degree of threat of each animal found in the Fuengirola Bioparc.
Nicolas Amigo Sañudo; nia9229; Football Data from Transfermarkt; https://www.kaggle.com/datasets/davidcariboo/player-scores/download?datasetVersionNumber=177; Clean, structured and automatically updated football data from Transfermarkt, including * 40,000+ games from many seasons on all major competitions * 300+ clubs from those competitions * 20,000+ players from those clubs * 900,000+ player appearance records from all games
Yuxiao Xiong; estrellaxyx; Best Books Ever Dataset; https://zenodo.org/record/4265096#.YyNPQy1JmRs; The dataset contains 25 variables and 52478 records corresponding to books on the GoodReads Best Books Ever list (the larges list on the site).The data was retrieved in two sets, the first 30000 books and then the remainig 22478. Dates were not parsed and reformated on the second chunk so publishDate and firstPublishDate are representet in a mm/dd/yyyy format for the first 30000 records and Month Day Year for the rest.
Gabriela Argüelles Terrón; gabyarte; Linked Movie Database; https://old.datahub.io/dataset/linkedmdb; A linked-data collection of movies, actors, directors, and the relationships between them.
Alejandro Macari; macari-dev; Film Permits; https://data.cityofnewyork.us/City-Government/Film-Permits/tg4x-b46p/data ; Permits are generally required when asserting the exclusive use of city property, like a sidewalk, a street, or a park.
Guillaume Champtoussel; GuillaumeChamp; Top PC Games based on the metascore (metacritic.com); https://doi.org/10.5281/zenodo.3712225;" The dataset contains 6 columns and 199 rows of pc games with the highest metascore available on metacritic. The title and publisher column contains the title of the game and the name of the company that published the game. The rating column contains the rating of the game with ""-"" which means there is no rating. The user_score column contains the average rating given by the user to the game. The metascore column contains the results of the assessment conducted by the metacritics of the game."
Erick Cedeno;erick4556;Summer Sports Experience;https://data.cityofnewyork.us/Recreation/Summer-Sports-Experience/xeg4-ic28/data;Activity and attendance records from the Summer Sports Experience program, which provides sports instruction to children ages 8 to 14.
Guillaume Champtoussel; GuillaumeChamp; Top PC Games based on the metascore (metacritic.com); https://doi.org/10.5281/zenodo.3712225;" The dataset contains 6 columns and 199 rows of pc games with the highest metascore available on metacritic. The title and publisher column contains the title of the game and the name of the company that published the game. The rating column contains the rating of the game with ""-"" which means there is no rating. The user_score column contains the average rating given by the user to the game. The metascore column contains the results of the assessment conducted by the metacritics of the game."
Anne-Fleur Kerhousse; afkerhousse; Madrid weather; https://www.kaggle.com/datasets/rober2598/madrid-weather-dataset-by-hours-20192022; The dataset contains 9 columns and 27024 rows representing weather conditions from 2019 to 2022 every hour of the day.
Ariel Ratzonel; ArielRatzonel00;New York Times - Linked Open Data; https://lod-cloud.net/dataset/nytimes-linked-open-data; The dataset contains the news vocabularies developed by New York Times in the last 150 years, in a linked data format.
Mihai Cristian Ursa; mihai-cristian3; Nights spent at tourist accommodation establishments - monthly data; https://ec.europa.eu/eurostat/web/products-datasets/-/tour_occ_nim; The datasets contains monthly information about accommodation statistics from UE countries: capacity in tourist accomodation establishments (number of establishments, number of bedrooms and number of bedplaces) and occupancy in tourist accommodation establishments (nights spent, arrivals and occupancy rates of bed places).
Nataly Matias; NatalyMMR; Enviromental poluttion; https://datos.alcobendas.org/dataset/calidad-del-aire/resource/58c1f2e4-6b9c-43e2-bc43-faae4c0ec2a5; The dataset contains 18 columns about enviromental pollution, they are atmospheric data collected at the Alcobendas station and the format is RDF.
Manuel Leira García-Baamonde; mlupm; Museos de la ciudad de Madrid; https://datos.madrid.es/portal/site/egob/menuitem.c05c1f754a33a9fbe4b2e4b284f1a5a0/?vgnextoid=118f2fdbecc63410VgnVCM1000000b205a0aRCRD&vgnextchannel=374512b9ace9f310VgnVCM100000171f5a0aRCRD&vgnextfmt=default; Dataset containing all the museus existing inside the city of Madrid. The dataset is available in RDF format and is updated every 15 days.
Carlos Sánchez Velázquez; carsanvel; Telegraphis Linked Open Data - World countries; http://telegraphis.net/data/countries/; The dataset contains linked data from all the world countries their capitals, currencies, land area, and more, it cand be downloaded as RDF and has an endpoint for SPARQL
Ignacio Blasco; Blas47; Formula 1 racing statistics from the beginning to the present day; https://zenodo.org/record/4662943#.YygqmC8lOgR; Dataset in which we find the history of Formula 1 races from its beginnings to the present day.
Ricardo Carvalho; ricardoomrques; TMDB 5000 Movie Dataset; https://www.kaggle.com/datasets/tmdb/tmdb-movie-metadata; Metadata on ~5,000 movies from TMDb
Luis Arconada Sousa; dasousarules; Top 5000 Albums of All Time; https://www.kaggle.com/datasets/michaelbryantds/top-5000-albums-of-all-time-rateyourmusiccom; The dataset contains 5000 ranked albums including ranking, album name, artist name, release date, genres, descriptors, average rating, number of ratings, and number of reviews.
Andrei Saavedra; saavedrAndrei; Mobile apps to fight the COVID-19 crisis; https://data.jrc.ec.europa.eu/dataset/c14cb1db-c31b-4bb9-95d2-ec7148708931;" The ""Mobile apps to fight the COVID-19 crisis"" introduces the different mobile apps published around the world to fight the impact of COVID-19 crisis."
Luis Ortiz Benito; LUISOB33; Credit score classification; https://www.kaggle.com/datasets/parisrohan/credit-score-classification; The dataset clasifies different credits according to their performance into different categories, it contains a total of 27 columns and with a total of 150000 instances (100000 alredy clasified for training and 50000 for test issues).
Maximilian Virkus; maxvirkus; Erasmus mobility statistics 2014 - 2018; https://data.europa.eu/data/datasets/erasmus-mobility-statistics-2014-2018?locale=en; This dataset contains the raw data for Erasmus+ mobility for students and staff in 2014-2018.
Paula Robles López; paumurl; Fuentes de agua de Madrid; https://datos.madrid.es/portal/site/egob/menuitem.c05c1f754a33a9fbe4b2e4b284f1a5a0/?vgnextoid=b8b2e44003b95510VgnVCM1000001d4a900aRCRD&vgnextchannel=374512b9ace9f310VgnVCM100000171f5a0aRCRD; This dataset details the water supplies in Madrid and where they can be located.
Alejandro Basco; abasco21; EVENTSKG; https://saidfathalla.github.io/EVENTS-Dataset/EVENTSKG-40.rdf; Dataset of scientific events, containing historical data about the publications, submissions, start date, end date, location and homepage for 25 top-prestigious events series.
Jorge Lizcano;Jorgelzn;7,000 Labeled Pokemon;https://www.kaggle.com/datasets/lantian773030/pokemonclassification;The dataset contains 7000 images of pokemon organised in folders and labeled by name
Andrea Pisani; andreapisa9; Wiki-MID Dataset; https://doi.org/10.6084/m9.figshare.6231326.v2; Multi-domain interests set to train and test Recommendation Systems
Alejandro de la Cruz López; acrulopez; Manually curated transcriptomics data collection for toxicogenomic assessment of engineered nanomaterials; https://zenodo.org/record/5744003#.YyRNTexBw-Q; Toxicogenomics (TGx) approaches are increasingly applied to gain insight into the possible toxicity mechanisms of engineered nanomaterials (ENMs). Omics data can be valuable to elucidate the mechanism of action of chemicals and develop predictive models in toxicology. While vast amounts of transcriptomics data from ENM exposures have already been accumulated, a unified, easily accessible and reusable collection of transcriptomics data for ENMs is currently lacking. In an attempt to improve the FAIRness of already existing transcriptomics data for nanomaterials, we curated a collection of homogenized transcriptomics data from human, mouse and rat ENM exposures in vitro and in vivo.
Natalia Bagnoli; NataliaBagnoli; Linked Sensor Data (Kno.e.sis); https://old.datahub.io/dataset/knoesis-linked-sensor-data; Datasets for sensors and sensor observations, created at Kno.e.sis Center, and converted from weather data at Mesowest. Contains descriptions of 20 thousand weather stations and 160 million observations.
Delia Moreno; deliamoreno2295; Libraries and bibliobuses in the city of Madrid; https://datos.madrid.es/portal/site/egob/menuitem.c05c1f754a33a9fbe4b2e4b284f1a5a0/?vgnextoid=ed35401429b83410VgnVCM1000000b205a0aRCRD&vgnextchannel=374512b9ace9f310VgnVCM100000171f5a0aRCRD&vgnextfmt=default;This dataset contains the public libraries and bibliobuses of Madrid. Additionally, you can find information on services, location and times, these data are useful for geolocation.
Laura Artiles; laurartiles; COVID-19 Cases and Deaths by Race/Ethnicity; https://www.kaggle.com/datasets/muhakabartay/covid19-cases-and-deaths-by-raceethnicity; The following data shows the number of COVID-19 cases and associated deaths per 100,000 population by race and ethnicity among Connecticut residents
Belén Gómez; belengg27; Melbourne Airbnb Open Data; https://www.kaggle.com/datasets/tylerx/melbourne-airbnb-open-data; These datasets describe the Airbnb activity in Melbourne, Australia. It has information about the dates each apartment is rented, the price, the reviews or the location and can be used for many different purposes like to predict the amount of visitors Melbourne will have in a certain month, how the hosting price vary through time and location, the favourite hosting areas for visitors or the preferred type of appartment depending of different factors.
Ian Recke; ianrecke; Accidental Drug Related Death 2012-2018; https://www.kaggle.com/datasets/muhakabartay/accidental-drug-related-deaths-20122018?select=Accidental_Drug_Related_Deaths_2012-2018.csv; Accidental death associated with drug overdose in Connecticut from 2012 to 2018.
Joao Romao; TsarkFC; Formula 1 World Championship (1950 - 2022); https://www.kaggle.com/datasets/rohanrao/formula-1-world-championship-1950-2020;F1 race data from 1950 to 2022
Jan Cerezo Pomykol; ershimen; GPU Benchmarks Compilation; https://www.kaggle.com/datasets/alanjo/gpu-benchmarks; Dataset containing GPU benchmark scores of ~2,300 models since 2009.
Alicia Garrido Vijande; aliciaagarrido;Forest Fire Statistics (Portugal); http://data.europa.eu/88u/dataset/60cb10e2078190f47a15522d?locale=en; The forest fire statistics dataset refers to the forest fire database transformed into RDF in the Cross-Forest project. For the transformation of the data into RDF, the workflow was followed where data in XLS format of the Institute of Nature and Forest Conservation (ICNF) was preprocessed for standard tabular data in CSV, then transformed into RDF.
Jorge Bolinches;JorgeOP46n2;RDF Data of Manchester City Library;http://link.manchesterlibrary.org/analytics.html;Data for Manchester City Library in Manchester, NH including their catalog, enriched with web data, along with physical locations, services, and offerings.
Gong Seong-Min; seongmingong; Sign Language MNIST; https://www.kaggle.com/datasets/datamunge/sign-language-mnist?select=sign_mnist_test; This dataset was created by benchmarking MNIST. Various sign language images can be practical to help deaf people communicate better using computer vision applications
Alvaro Montorio;alvaromontorio;Meteo France Weather;https://zenodo.org/record/5593216#.YyR5QaTP1-c;The Dataset represents in RDF the meteorological observations made by Meteo France weather stations from January to October 2021
Andrea Hetlevik Vanebo; andreavanebo; Cycling routes; https://data.europa.eu/data/datasets/ciclovie?locale=en; This is an italian dataset of different cycling routes.
Henrik Brun Fevang; Henrbfe; Yelp Coffee Reviews; https://www.kaggle.com/datasets/sripaadsrinivasan/yelp-coffee-reviews?resource=download; Contains about 7000 reviews of different coffee shops published in Yelp. Includes a 0-5 rating, a text describing the experience for each visit.
Valentin Musat;VMusat;Cats and Dogs;https://www.kaggle.com/datasets/chetankv/dogs-cats-images;Dataset that contains 10.000 images of dogs and cats separated in training and testing sets
José Ignacio Álvarez de Miranda Rodríguez;jialvarezdemiranda;Mushroom Classification;https://www.kaggle.com/datasets/uciml/mushroom-classification; This dataset contains information about diferent types of mushrooms and a label that indicates if they are edible or poisonous.
Adrián Vogel; a-vogel98; XL-Sum; https://huggingface.co/datasets/csebuetnlp/xlsum; Multilingual dataset with news and their respective summaries, generaly used to train NLP models.
Adrián Girón Jiménez;adgiz05;Fetal Health Classification;https://www.kaggle.com/datasets/andrewmvd/fetal-health-classification;Classify fetal health in order to prevent child and maternal mortality.
Simen Kathirgamadas;Simenkathir;EU Soccer League;https://lod-cloud.net/dataset/KG:Course%20Submission;This dataset contains link Football matches with players who played in a match in a RDF file.
Asís Pinedo ;Asispinedo;Deportes. Centros Deportivos Municipales (Polideportivos);https://datos.madrid.es/portal/site/egob/menuitem.c05c1f754a33a9fbe4b2e4b284f1a5a0/?vgnextoid=4a5fbef4b2503410VgnVCM2000000c205a0aRCRD&vgnextchannel=374512b9ace9f310VgnVCM100000171f5a0aRCRD&vgnextfmt=default;This dataset contains data of the location, transport, contact info, opening hours, services and available surfaces of the basic municipal sports facilities. Each of them is spatially located and referenced.
Andrés González Díaz;ofeucor;Traffic from the Port of Santander;https://data.europa.eu/data/datasets/https-www-icane-es-data-trafico-portuario-santander?locale=en;Dataset del tráfico del Puerto de Santander
Marta Alonso Tubía; martatubia; Household Economic Survey: Expenditure Statistics Year ended June 2019; https://www.stats.govt.nz/information-releases/household-expenditure-statistics-year-ended-june-2019; The household economic survey (HES) is an annual survey designed to measure the economic wellbeing of New Zealanders. Data from HES (Expenditure) also feeds into vital economic measures for the country, such as the consumers price index, household living-costs price indexes, and gross domestic product. Data from HES (Expenditure) is also used to measure child poverty.
César Gayo Bravo; drachodran; Avisos ciudadanos; https://datos.madrid.es/egob/catalogo/212411-31-madrid-avisa.csv; Received notification of events on the street in the city of Madrid 2022
Neri Dervisheva;neridervishevaupm;Actividades Culturales y de Ocio Municipal en los proximos 100 dias;https://datos.madrid.es/portal/site/egob/menuitem.c05c1f754a33a9fbe4b2e4b284f1a5a0/?vgnextoid=6c0b6d01df986410VgnVCM2000000c205a0aRCRD&vgnextchannel=374512b9ace9f310VgnVCM100000171f5a0aRCRD&vgnextfmt=default;List of cultural activities, which will be held in the next 100 days and in municipal centers.
Gema Díaz Ferreiro; gemadf; Accidentes de trabajo y enfermedades profesionales; http://data.europa.eu/88u/dataset/https-www-icane-es-data-accidentes-trabajo?locale=es; Relation between occupational accident with its gravity, gender and activity.
Victor de Dios Dominguez; victordd99; Puntos de atencion a mujeres; https://datos.madrid.es/portal/site/egob/menuitem.c05c1f754a33a9fbe4b2e4b284f1a5a0/?vgnextoid=93c1a516f8045410VgnVCM1000000b205a0aRCRD&vgnextchannel=374512b9ace9f310VgnVCM100000171f5a0aRCRD&vgnextfmt=default;The dataset contains information about the different spaces dedicated to gender equality in Madrid.
Rafael Sojo Garcia; rafasj13; Actions of the Fire Department; https://datos.madrid.es/portal/site/egob/menuitem.c05c1f754a33a9fbe4b2e4b284f1a5a0/?vgnextoid=fa677996afc6f510VgnVCM1000001d4a900aRCRD&vgnextchannel=374512b9ace9f310VgnVCM100000171f5a0aRCRD&vgnextfmt=default; This dataset collects statistical information on the actions of the Fire Department on a monthly basis, where it shows information on interventions carried out by district and by type of intervention.
Daniel Loriente; dloriente; FakeCovid- A Multilingual Cross domain Fact Check Dataset for COVID-19; https://doi.org/10.5281/zenodo.3965870; A multilingual cross-domain dataset of 7623 fact-checked news articles for COVID-19, collected in 2020.
Carlos Miguel Alonso; charly98cma; European Space Agency (ESA) Global Snow Water Equivalent Monitoring; https://arcticdata.io/catalog/view/doi%3A10.18739%2FA2CC0TV10; The dataset contains satellite-retrieved information on snow water equivalent (SWE) extending 34 years. The record on snow water equivalent is produced using a combination of passive microwave radiometer and ground-based weather station data, spanning years 1979 to 2013. The GlobSnow SWE record, based on methodology by Pulliainen (Pulliainen 2006, Takala et al. 2011) utilizes a data-assimilation based approach combining space-borne passive radiometer data (SMMR, SSM/I and SSMIS) with data from ground-based synoptic weather stations.