Analytics Toolbox for Databricks

The CARTO Analytics Toolbox for Databricks is designed to bring advanced geospatial analytics to Databricks users by building on top of both Apache Sedona and native Databricks Spatial SQL.

This integration equips data engineers, data scientists, and analysts with robust geospatial tools tailored for Databricks’ unique ecosystem, optimizing workflows and expanding the scope of spatial analysis.

Check out Databricks’ announcement on Spatial SQL in this video.

The CARTO Analytics Toolbox requires Databricks Spatial SQL, currently in private preview.

Access to the preview functions can be requested through this form.

The CARTO Analytics Toolbox extends geospatial functionalities by adding support for:

  • Tileset creation for vector and spatial index data sources.

  • Raster fast data access and intersections with vector tables through SQL .

  • Location Data Services for geocoding, isoline generation and routing right from Databricks, leveraging top-class third party services.

  • Advanced geospatial statistics like Composite Scores, Moran's I, Local Moran's I, Hotspots Analysis or geographically weighted regressions.

  • Industry-specific capabilities, like cell towers' signal propagation models in mobile networks for the Telco industry.

Why the CARTO Analytics Toolbox?

For organizations handling spatial data, CARTO’s Analytics Toolbox provides a streamlined, Databricks-native solution that surpasses what is achievable with open-source tools like Apache Sedona or Databricks Mosaic alone.

While Sedona offers foundational geospatial functionality and Mosaic provides more specific functionality, the CARTO Analytics Toolbox offers:

  • Easy setup on your Databricks all-purpose compute clusters, both shared and single-user.

  • Supports latest Databricks product features: like DBR 15.x, or Photon acceleration.

  • Continuous improvements and support from the CARTO team, officially backed by Databricks.

Getting access

The Analytics Toolbox is distributed as a JAR package ready to be installed in your Databricks clusters. Take a look at the Getting access section to learn about different installation options.

Use the Analytics Toolbox

Visit the Reference to see the full list of available functions and check our Guides to learn more about different use cases.

Last updated