Resource Guide for AI for Connectivity Hackathon

This resource guide is designed to help you minimize the time spent searching for relevant datasets, tools, and collections, allowing you to focus on what matters most: building innovative solutions. It includes a carefully curated selection of resources that align with a wide variety of goals, ranging from analyzing connectivity gaps to optimizing networks and exploring cutting-edge technologies.

While this guide offers a broad spectrum of solutions, the intention is not to overwhelm but to provide options tailored to your needs. Each resource has been selected to address different aspects of the network lifecycle, covering design, deployment, monitoring, and optimization. It also incorporates technologies beyond the mainstream, encouraging creativity and diversity in your approach.

Whether you're tackling rural connectivity, improving network infrastructure, or innovating with advanced 5G technologies, this guide equips you with the tools and datasets necessary to achieve impactful results.

Take your time exploring the resources, and choose the ones that best support your objectives. With this guide, you'll have access to a wealth of knowledge and technology to bring your ideas to life. Good luck, and happy building!

Table of Contents

  1. Datasets
  2. Simulators and Tools
  3. Collections and Resources

1. Datasets

Connectivity and Network Infrastructure

1.1 Giga School Mapping Data

  • Description: School geolocation and connectivity data, using sources like OpenStreetMap and government datasets.
  • Use: Analyze connectivity gaps and prioritize infrastructure investments in education.
  • Source: Giga
  • Documentation Link: Giga School Mapping
  • Data Access Link: Data Access

1.2 Giga School Connectivity Data

  • Description: School connectivity data from Giga’s Daily Check App, installed in schools across 21+ countries. Measure
  • Use: Analyze connectivity patterns in end-user devices.
  • Source: Giga
  • Documentation Link: Giga School Connectivity Data
  • Data Access Link: Data Access

1.3 Broadband Maps

  • Description: Displays global broadband infrastructure and transmission networks based on ITU research.
  • Source: ITU
  • Link: Broadband Maps

1.4 OpenCellID

  • Description: Offers global cell tower location data categorized by radio type, such as 4G and 5G.
  • Source: OpenCellID
  • Link: OpenCellID

1.5 Cloudflare Radar

  • Description: Provides global internet traffic patterns, DNS data, attack activity, and network trends.
  • Source: Cloudflare
  • Link: Cloudflare Radar

1.6 Internet Exchange Map

  • Description: Global map of internet exchange points, including IXPs’ architecture and utilization.
  • Source: TeleGeography
  • Link: Internet Exchange Map

1.7 Submarine Cable Map

  • Description: Visualizes submarine cable networks globally, maintained by TeleGeography.
  • Source: TeleGeography
  • Link: Submarine Cable Map

Geospatial Data

2.1 Sentinel-2

  • Description: Provides multispectral imagery for applications like land cover classification.
  • Source: European Space Agency
  • Link: Sentinel-2 Data

2.2 VIIRS Nighttime Lights

  • Description: Measures nighttime light intensity to reflect population density and economic activity.
  • Source: NASA
  • Link: VIIRS Lights

2.3 Global Human Settlement Layer

  • Description: Offers data on built-up surfaces, land use, and population density.
  • Source: World Bank
  • Link: GHSL Data

2.4 Google Open Buildings

  • Description: Building outlines for regions like Africa, South Asia, and the Caribbean, created from satellite imagery.
  • Source: Google Research
  • Link: Google Open Buildings

2.5 Google Earth Engine

  • Description: Provides access to a multi-petabyte catalog of satellite imagery and geospatial datasets for detecting changes, mapping trends, and quantifying differences on the Earth’s surface.
  • Source: Google for Developers
  • Link: Earth Engine Data Catalog

2.6 Overture Maps POI Data

  • Description: Global point-of-interest (POI) data for applications like urban planning and logistics.
  • Source: Overture Maps Foundation
  • Link: POIs Data

2.7 Global Electricity Grid

  • Description: Derived global map of electricity transmission and distribution lines, produced by the World Bank using the Gridfinder model.
  • Source: World Bank
  • Link: World Bank Data Catalog

2.8 Microsoft Global Building Footprints

  • Description: Worldwide building footprint data derived from satellite imagery.
  • Source: Microsoft
  • Link: Microsoft Building Footprints

Internet Measurement Data

3.1 OONI Explorer

  • Description: Open data on internet censorship worldwide, contributed by OONI Probe users.
  • Source: OONI
  • Link: OONI Explorer

3.2 Ookla Speedtest

  • Description: Global dataset of download and upload speeds, and latency for broadband and mobile networks.
  • Source: Ookla
  • Link: Ookla Speedtest Data

3.3 Internet Outage Detection and Analysis (IODA)

  • Description: Monitors global internet outages and provides detailed analysis through APIs and tools.
  • Source: IODA
  • Link: IODA API

Socioeconomic and Demographic Data

4.1 Subnational Human Development Index (HDI)

  • Description: Provides health, education, and income data at subnational levels.
  • Source: UNDP
  • Link: HDI Data

4.2 Relative Wealth Index

  • Description: Uses satellite imagery and de-identified Facebook data to estimate living standards within countries.
  • Source: Facebook Research
  • Link: Relative Wealth Index

Point of Interest (POI) Data

5.1 OpenStreetMap

  • Description: Community-driven geospatial data covering roads, buildings, and POIs.
  • Source: OpenStreetMap Foundation
  • Link: OpenStreetMap

5.2 Health Center Location Data

  • Description: Online map of global health facilities, including precise location data.
  • Source: HealthSites
  • Link: Health Center Locations

Additional Network Modeling and Simulation Datasets

6.1 Network Modeling Datasets

  • Description: Datasets generated using OMNet++ simulators to train and validate machine learning models for network optimization and KPI predictions.
    • Includes per-source/destination metrics like delay, jitter, and loss.
  • Source: BNN-UPC
  • Link: Network Modeling Datasets

6.2 Infinite Dataset Hub - 5GNetworkOptimization

  • Description: AI-generated dataset focusing on 5G optimization strategies, including resource allocation, throughput maximization, and load balancing.
  • Source: Hugging Face
  • Link: 5GNetworkOptimization

2. Simulators and Tools

BNNetSimulator


Hugging Face Infinite Dataset Hub

Overview

The Hugging Face Infinite Dataset Hub is an experimental platform offering AI-generated datasets tailored to various applications. This resource provides dynamic datasets suitable for research and development in domains like telecom, infrastructure planning, and network optimization. The datasets are particularly useful for hackathon participants looking to quickly prototype and test AI models.

Examples of Relevant Datasets that could be created

  1. 5GNetworkOptimization

    • Tags: Resource Allocation, Throughput Maximization, Load Balancing
    • Description: Focused on optimizing 5G network performance through strategies like resource allocation and load balancing. Ideal for projects involving network traffic management or improving Quality of Service (QoS) in underserved areas.
    • Link: 5GNetworkOptimization
  2. NetworkInfrastructureInsights

    • Tags: Network Mapping, Infrastructure Gaps, Connectivity Planning
    • Description: Provides data simulating network infrastructure insights, such as signal coverage, tower density, and connectivity gaps. This dataset is useful for identifying areas needing infrastructure upgrades, particularly in rural or underserved locations.
  3. BroadbandAccessForecast

    • Tags: Connectivity, Predictive Analysis, Internet Access
    • Description: AI-generated dataset forecasting broadband access trends over time, including data on adoption rates, usage patterns, and geographic coverage. Could help in planning connectivity initiatives or evaluating policy impacts.
  4. LatencyOptimization

    • Tags: Network Performance, Delay Reduction, Service Improvement
    • Description: Contains simulated latency metrics for various network configurations, focusing on reducing delay in high-traffic scenarios. Useful for hackathon teams working on real-time applications or improving user experience in dense networks.

Note

All datasets in the Infinite Dataset Hub are AI-generated and may contain inaccuracies. They are best suited for prototyping and proof-of-concept projects. Feedback on the platform is encouraged to improve its offerings.


OsmoBTS

  • Description: Open-source implementation of a GSM Base Transceiver Station (BTS). It is a key component in deploying 2G networks, especially in remote areas where modern networks are impractical.
  • Link: OsmoBTS

OpenBTS

  • Description: A software-defined radio implementation of a GSM BTS, enabling the deployment of GSM networks. Ideal for creating low-cost, small-scale cellular networks.
  • Link: OpenBTS

Magma

  • Description: A flexible, open-source core network platform that supports LTE, 5G, and Wi-Fi access. Designed to bring connectivity to rural and underserved regions with a focus on scalability and cost-efficiency.
  • Link: Magma

LibreRouter

  • Description: An open-source hardware and software solution for community-owned networks. It provides a framework for deploying resilient and cost-effective wireless networks.
  • Link: LibreRouter

Open5GS

  • Description: An open-source project implementing both 5G Core Network (5GC) and 4G Evolved Packet Core (EPC).
  • Link: Open5GS

3. Collections and Resources

Awesome-Telco

  • Description: A curated and comprehensive list of telecom-related tools, resources, and projects that span multiple aspects of telecommunication technologies. This repository is designed for researchers, developers, and enthusiasts working on telco systems and includes a rich variety of tools and frameworks for:
    • SIM Management:
      • Tools like PySIM for programming and managing SIM cards.
      • Utilities for exploring and testing programmable SIM/USIM/ISIM cards.
      • Projects for implementing and testing eSIM and UICC technologies.
    • Network Infrastructure:
      • Open-source projects for building and managing LTE and 5G networks, such as Open5GS and srsLTE.
      • Tools for deploying and simulating Radio Access Networks (RAN) and Core Networks (CN).
      • Resources for network automation, virtualization (NFV), and orchestration using platforms like Kubernetes.
    • Testing and Debugging:
      • Tools like SigPloit for telecom security testing.
      • Frameworks for diagnosing and simulating network behavior, such as S1APTester and LTE-CellScanner.
      • Resources for evaluating protocols, such as GTP, SCTP, and Diameter.
    • Security:
      • Projects focused on detecting and mitigating vulnerabilities in telecom systems, including SS7 and 5G networks.
      • Tools for signal interception and security research.
    • Documentation and Learning:
      • References to key standards, specifications, and tutorials for telecom professionals.
  • Link: Awesome-Telco

Awesome-Connectivity

  • Description: A detailed repository maintained by Steve Song, focusing on connectivity-related resources. This collection provides insights into the infrastructure, access, and connectivity status across different regions, helping users understand who has access to communication networks and on what terms. Key highlights include:
    • Connectivity Maps:
      • Resources such as submarine cable maps, internet exchange maps, and regional broadband infrastructure maps.
      • Tools and datasets for visualizing global and regional connectivity patterns.
    • Indexes and Reports:
      • Lists of global and regional connectivity indexes like bandwidth availability, pricing, and performance metrics.
      • Reports that analyze trends in internet penetration, network resilience, and infrastructure investments.
    • Policy and Advocacy:
      • Resources for understanding policy decisions affecting connectivity, including open data on regulations and their impact on telecom access.
    • Technical Data:
      • Detailed datasets for analyzing network coverage, backbone infrastructure, and interconnection points.
  • Link: Awesome-Connectivity