The GDELT Project is a remarkable initiative that monitors our world by analyzing global news from various sources. Here are the key aspects of the GDELT dataset:
Scope and Purpose:
- The GDELT Project aims to create a comprehensive, real-time database of global human society.
- It monitors news from broadcasts, print media, and web sources in nearly every country and over 100 languages.
- By analyzing this vast dataset, it identifies people, locations, organizations, themes, emotions, and events that shape our global society every second of every day.
Data Collection:
- GDELT continuously captures and analyzes news articles, broadcasts, and online sources.
- Its historical archives date back to January 1, 1979, and it updates every 15 minutes.
- The project goes beyond Western media, providing a more global perspective on world events and sentiments.
Features:
- GDELT uses sophisticated natural language and data mining algorithms, including powerful deep learning techniques.
- It extracts over 300 categories of events, millions of themes, thousands of emotions, and the networks connecting them.
- The dataset models human interactions at a large scale, making it valuable for research and analysis.
Vision:
- The GDELT Project envisions using this data to:
Global Reach:
- GDELT monitors media in over 100 languages across every country, providing a truly global perspective.
- It allows us to explore how social media is used worldwide and how people express themselves online.
Open Data:
- The entire GDELT database is free and open.
- Researchers can download raw data, visualize it, or analyze it at scale using tools like Google BigQuery¹²³⁴⁵.
Source: Conversation with Bing, 3/12/2024
(1) The GDELT Project. https://www.gdeltproject.org/.
(2) The GDELT Database | Aalto Datahub. https://datahub.aalto.fi/en/data-sources/the-gdelt-database.
(3) An Introduction to GDELT Data | MongoDB. https://www.mongodb.com/developer/products/mongodb/introduction-to-gdelt-data/.
(4) GDELT 2.0: Our Global World in Realtime – The GDELT Project. https://blog.gdeltproject.org/gdelt-2-0-our-global-world-in-realtime/.
(5) Data: Querying, Analyzing and Downloading: The GDELT Project. https://www.gdeltproject.org/data.html.
Variants: GDELT
This dataset is used in 1 benchmark:
Task | Model | Paper | Date |
---|---|---|---|
Link Prediction | SPA | Search to Pass Messages for … | 2022-10-30 |
Link Prediction | TTransE | RotateQVS: Representing Temporal Information as … | 2022-03-15 |
Link Prediction | TransE | RotateQVS: Representing Temporal Information as … | 2022-03-15 |
Link Prediction | RotateQVS | RotateQVS: Representing Temporal Information as … | 2022-03-15 |
Link Prediction | RotateQVS-Small | RotateQVS: Representing Temporal Information as … | 2022-03-15 |
Link Prediction | TeRo-Large | RotateQVS: Representing Temporal Information as … | 2022-03-15 |
Link Prediction | TeRo | RotateQVS: Representing Temporal Information as … | 2022-03-15 |
Link Prediction | DE-SimplE | RotateQVS: Representing Temporal Information as … | 2022-03-15 |
Link Prediction | DistMult | RotateQVS: Representing Temporal Information as … | 2022-03-15 |
Link Prediction | TA-DistMult | RotateQVS: Representing Temporal Information as … | 2022-03-15 |
Recent papers with results on this dataset: