Accessing DE Africa Data and Functionality via Cloud Computing Platforms

Directly accessing Digital Earth Africa (DE Africa) data in an environment of choice provides significant advantages, including the flexibility to scale compute resources in the cloud and the ability to integrate DE Africa datasets with user-generated or third-party data. Beyond the DE Africa Sandbox, two widely adopted platforms—Google Colab and Amazon SageMaker Studio Lab—offer accessible options for running DE Africa workflows.

Google Colab is a free, browser-based environment that comes preloaded with popular data science and geospatial libraries, GPU/TPU acceleration, and seamless integration with Google Drive and GitHub. Amazon SageMaker Studio Lab, in contrast, provides a JupyterLab-style interface with persistent storage, free CPU/GPU resources, and integration with AWS services, with the option to scale into full Amazon SageMaker for enterprise-level applications. By linking DE Africa’s Direct Access to data through these platforms, users can load and analyze Analysis Ready Data (ARD) products—such as Fractional Cover or Water Observations from Space—and fuse them with other datasets, enabling reproducible, large-scale analyses that are both collaborative and cloud-optimized.

⚠️ Note: Both Colab and SageMaker have free tiers that are sufficient for running most notebooks. However, for larger applications and upscaling, users may consider purchasing larger compute instances through commercial cloud providers


Setting Up DE Africa Direct Access

To make it easier for users to start working with Digital Earth Africa datasets outside the Sandbox, a ready-to-use tutorial and demonstration notebook have been prepared. These resources guide you through the setup process and show how to load and analyze data using Direct Access packages.

Steps to Get Started

  1. Open the GitHub page
  2. Select the demonstration notebook
    Open the notebook file DEAfrica_loading_demonstration.ipynb.
  3. Launch in a cloud environment
    Use the “Open in Colab” button (or copy the notebook into Amazon SageMaker Studio Lab).
  4. Install the required packages
    Follow the setup cells in the notebook to install and configure the DE Africa Direct Access packages.
  5. Run the examples
    Execute the notebook cells to load and visualize Digital Earth Africa datasets such as USGS Landsat Collection 2 Level-2 Surface Reflectance, Water Observations, or GeoMAD composites.
  6. Continue with more notebooks
    Once you confirm the demonstration notebook runs successfully, explore the other notebooks provided in:

    Recommended order:

    1. Start with the Datasets notebooks to practice loading and working with different Digital Earth Africa products.

    2. Move on to the Real World Examples to see how the data can be applied in practical analysis.

    3. Finally, explore the Use Cases to understand how DE Africa data and tools support decision-making across sectors.

    These additional notebooks also run on the online platforms (Colab, SageMaker) and will help you become more familiar with how Direct Access works in practice.

Accessing DE Africa Tools Outside the Sandbox

Apart from having direct access to Digital Earth Africa datasets, many of the tools and functions available in the DE Africa Sandbox can also be used in external cloud environments. This means you are not limited to the Sandbox — the same functions can be imported and run in Google Colab, Amazon SageMaker Studio Lab, or other Jupyter-based environments.

Examples of available functions include:

  • deafrica_display_map for interactive map visualization.

  • deafrica_rgb plotting functions for quick rendering of satellite imagery.

  • calculate_indices for computing vegetation, water, and other spectral indices.

  • xr_rasterize for converting vector data to raster grids.

As demonstrated in the loading notebook — and other example notebooks — these functions work seamlessly once the required packages are installed, making it possible to replicate Sandbox workflows in any supported environment.


⚠️ Note: Cloud computing platforms such as Google Colab and SageMaker Studio Lab have their own core-based infrastructure rather than a native ODC environment.

  • Functions that depend heavily on ODC (e.g., dc.load and load_ard) may not work as expected.

  • However, most Sandbox-supported functions that rely on widely used Python libraries such as xarray, geopandas, and scipy are fully supported and can be executed without issue.


Conclusion

Using DE Africa Direct Access with cloud platforms like Google Colab and Amazon SageMaker Studio Lab gives users flexibility beyond the Sandbox. While some ODC-dependent workflows may not run natively, the majority of DE Africa tools and functions can be used effectively. This makes it possible to build reproducible, scalable, and collaborative workflows that integrate DE Africa datasets with other data sources, opening up broader opportunities for research, innovation, and decision-making.


💡 Feedback

If you have any feedback on DE Africa data and services, please contact us at:

🚀 Looking Ahead

The DE Africa team will continue to improve these tools and services to make it easier for users to flexibly access, analyze, and integrate data in the environments of their choice.