LogoLogo
HomeAcademyLoginTry for free
  • Welcome
  • What's new
    • Q2 2025
    • Q1 2025
    • Q4 2024
    • Q3 2024
    • Q2 2024
    • Q1 2024
    • Q4 2023
    • Q3 2023
    • Q2 2023
    • Q1 2023
    • Q4 2022
    • Q3 2022
  • FAQs
    • Accounts
    • Migration to the new platform
    • User & organization setup
    • General
    • Builder
    • Workflows
    • Data Observatory
    • Analytics Toolbox
    • Development Tools
    • Deployment Options
    • CARTO Basemaps
    • CARTO for Education
    • Support Packages
    • Security and Compliance
  • Getting started
    • What is CARTO?
    • Quickstart guides
      • Connecting to your data
      • Creating your first map
      • Creating your first workflow
      • Developing your first application
    • CARTO Academy
  • CARTO User Manual
    • Overview
      • Creating your CARTO organization
      • CARTO Cloud Regions
      • CARTO Workspace overview
    • Maps
      • Data sources
        • Simple features
        • Spatial Indexes
        • Pre-generated tilesets
        • Rasters
        • Defining source spatial data
        • Managing data freshness
        • Changing data source location
      • Layers
        • Point
          • Grid point aggregation
          • H3 point aggregation
          • Heatmap point aggregation
          • Cluster point aggregation
        • Polygon
        • Line
        • Grid
        • H3
        • Raster
        • Zoom to layer
      • Widgets
        • Formula widget
        • Category widget
        • Pie widget
        • Histogram widget
        • Range widget
        • Time Series widget
        • Table widget
      • SQL Parameters
        • Date parameter
        • Text parameter
        • Numeric parameter
        • Publishing SQL parameters
      • Interactions
      • Legend
      • Basemaps
        • Basemap selector
      • AI Agents
      • SQL analyses
      • Map view modes
      • Map description
      • Feature selection tool
      • Search locations
      • Measure distances
      • Exporting data
      • Download PDF reports
      • Managing maps
      • Sharing and collaboration
        • Editor collaboration
        • Map preview for editors
        • Map settings for viewers
        • Comments
        • Embedding maps
        • URL parameters
      • Performance considerations
    • Workflows
      • Workflow canvas
      • Results panel
      • Components
        • Aggregation
        • Custom
        • Data Enrichment
        • Data Preparation
        • Generative AI
        • Input / Output
        • Joins
        • Parsers
        • Raster Operations
        • Spatial Accessors
        • Spatial Analysis
        • Spatial Constructors
        • Spatial Indexes
        • Spatial Operations
        • Statistics
        • Tileset Creation
        • BigQuery ML
        • Snowflake ML
        • Google Earth Engine
        • Google Environment APIs
        • Telco Signal Propagation Models
      • Data Sources
      • Scheduling workflows
      • Sharing workflows
      • Using variables in workflows
      • Executing workflows via API
      • Temporary data in Workflows
      • Extension Packages
      • Managing workflows
      • Workflows best practices
    • Data Explorer
      • Creating a map from your data
      • Importing data
        • Importing rasters
      • Geocoding data
      • Optimizing your data
    • Data Observatory
      • Terminology
      • Browsing the Spatial Data Catalog
      • Subscribing to public and premium datasets
      • Accessing free data samples
      • Managing your subscriptions
      • Accessing your subscriptions from your data warehouse
        • Access data in BigQuery
        • Access data in Snowflake
        • Access data in Databricks
        • Access data in Redshift
        • Access data in PostgreSQL
    • Connections
      • Google BigQuery
      • Snowflake
      • Databricks
      • Amazon Redshift
      • PostgreSQL
      • CARTO Data Warehouse
      • Sharing connections
      • Deleting a connection
      • Required permissions
      • IP whitelisting
      • Customer data responsibilities
    • Applications
    • Settings
      • Understanding your organization quotas
      • Activity Data
        • Activity Data Reference
        • Activity Data Examples
        • Activity Data Changelog
      • Users and Groups
        • Inviting users to your organization
        • Managing user roles
        • Deleting users
        • SSO
        • Groups
        • Mapping groups to user roles
      • CARTO Support Access
      • Customizations
        • Customizing appearance and branding
        • Configuring custom color palettes
        • Configuring your organization basemaps
        • Enabling AI Agents
      • Advanced Settings
        • Managing applications
        • Configuring S3 Bucket for Redshift Imports
        • Configuring OAuth connections to Snowflake
        • Configuring OAuth U2M connections to Databricks
        • Configuring S3 Bucket integration for RDS for PostgreSQL Exports in Builder
        • Configuring Workload Identity Federation for BigQuery
      • Data Observatory
      • Deleting your organization
    • Developers
      • Managing Credentials
        • API Base URL
        • API Access Tokens
        • SPA OAuth Clients
        • M2M OAuth Clients
      • Named Sources
  • Data and Analysis
    • Analytics Toolbox Overview
    • Analytics Toolbox for BigQuery
      • Getting access
        • Projects maintained by CARTO in different BigQuery regions
        • Manual installation in your own project
        • Installation in a Google Cloud VPC
        • Core module
      • Key concepts
        • Tilesets
        • Spatial indexes
      • SQL Reference
        • accessors
        • clustering
        • constructors
        • cpg
        • data
        • http_request
        • import
        • geohash
        • h3
        • lds
        • measurements
        • placekey
        • processing
        • quadbin
        • random
        • raster
        • retail
        • routing
        • s2
        • statistics
        • telco
        • tiler
        • transformations
      • Guides
        • Running queries from Builder
        • Working with Raster data
      • Release notes
      • About Analytics Toolbox regions
    • Analytics Toolbox for Snowflake
      • Getting access
        • Native App from Snowflake's Marketplace
        • Manual installation
      • Key concepts
        • Spatial indexes
        • Tilesets
      • SQL Reference
        • accessors
        • clustering
        • constructors
        • data
        • http_request
        • import
        • h3
        • lds
        • measurements
        • placekey
        • processing
        • quadbin
        • random
        • raster
        • retail
        • s2
        • statistics
        • tiler
        • transformations
      • Guides
        • Running queries from Builder
        • Working with Raster data
      • Release Notes
    • Analytics Toolbox for Databricks
      • Getting access
        • Personal (former Single User) cluster
        • Standard (former Shared) cluster
      • Reference
        • lds
        • tiler
      • Guides
      • Release Notes
    • Analytics Toolbox for Redshift
      • Getting access
        • Manual installation in your database
        • Installation in an Amazon Web Services VPC
        • Core version
      • Key concepts
        • Tilesets
        • Spatial indexes
      • SQL Reference
        • clustering
        • constructors
        • data
        • http_request
        • import
        • lds
        • placekey
        • processing
        • quadbin
        • random
        • s2
        • statistics
        • tiler
        • transformations
      • Guides
        • Running queries from Builder
      • Release Notes
    • Analytics Toolbox for PostgreSQL
      • Getting access
        • Manual installation
        • Core version
      • Key concepts
        • Tilesets
        • Spatial Indexes
      • SQL Reference
        • h3
        • quadbin
        • tiler
      • Guides
        • Creating spatial index tilesets
        • Running queries from Builder
      • Release Notes
    • CARTO + Python
      • Installation
      • Authentication Methods
      • Visualizing Data
      • Working with Data
        • How to work with your data in the CARTO Data Warehouse
        • How to access your Data Observatory subscriptions
        • How to access CARTO's Analytics Toolbox for BigQuery and create visualizations via Python notebooks
        • How to access CARTO’s Analytics Toolbox for Snowflake and create visualizations via Python notebooks
        • How to visualize data from Databricks
      • Reference
    • CARTO QGIS Plugin
  • CARTO for Developers
    • Overview
    • Key concepts
      • Architecture
      • Libraries and APIs
      • Authentication methods
        • API Access Tokens
        • OAuth Access Tokens
        • OAuth Clients
      • Connections
      • Data sources
      • Visualization with deck.gl
        • Basemaps
          • CARTO Basemap
          • Google Maps
            • Examples
              • Gallery
              • Getting Started
              • Basic Examples
                • Hello World
                • BigQuery Tileset Layer
                • Data Observatory Tileset Layer
              • Advanced Examples
                • Arc Layer
                • Extrusion
                • Trips Layer
            • What's New
          • Amazon Location
            • Examples
              • Hello World
              • CartoLayer
            • What's New
        • Rapid Map Prototyping
      • Charts and widgets
      • Filtering and interactivity
      • Summary
    • Quickstart
      • Make your first API call
      • Visualize your first dataset
      • Create your first widget
    • Guides
      • Build a public application
      • Build a private application
      • Build a private application using SSO
      • Visualize massive datasets
      • Integrate CARTO in your existing application
      • Use Boundaries in your application
      • Avoid exposing SQL queries with Named Sources
      • Managing cache in your CARTO applications
    • Reference
      • Deck (@deck.gl reference)
      • Data Sources
        • vectorTableSource
        • vectorQuerySource
        • vectorTilesetSource
        • h3TableSource
        • h3QuerySource
        • h3TilesetSource
        • quadbinTableSource
        • quadbinQuerySource
        • quadbinTilesetSource
        • rasterSource
        • boundaryTableSource
        • boundaryQuerySource
      • Layers (@deck.gl/carto)
      • Widgets
        • Data Sources
        • Server-side vs. client-side
        • Models
          • getFormula
          • getCategories
          • getHistogram
          • getRange
          • getScatter
          • getTimeSeries
          • getTable
      • Filters
        • Column filters
        • Spatial filters
      • CARTO APIs Reference
    • Release Notes
    • Examples
    • CARTO for React
      • Guides
        • Getting Started
        • Views
        • Data Sources
        • Layers
        • Widgets
        • Authentication and Authorization
        • Basemaps
        • Look and Feel
        • Query Parameters
        • Code Generator
        • Sample Applications
        • Deployment
        • Upgrade Guide
      • Examples
      • Library Reference
        • Introduction
        • API
        • Auth
        • Basemaps
        • Core
        • Redux
        • UI
        • Widgets
      • Release Notes
  • CARTO Self-Hosted
    • Overview
    • Key concepts
      • Architecture
      • Deployment requirements
    • Quickstarts
      • Single VM deployment (Kots)
      • Orchestrated container deployment (Kots)
      • Advanced Orchestrated container deployment (Helm)
    • Guides
      • Guides (Kots)
        • Configure your own buckets
        • Configure an external in-memory cache
        • Enable Google Basemaps
        • Enable the CARTO Data Warehouse
        • Configure an external proxy
        • Enable BigQuery OAuth connections
        • Configure Single Sign-On (SSO)
        • Use Workload Identity in GCP
        • High availability configuration for CARTO Self-hosted
        • Configure your custom service account
      • Guides (Helm)
        • Configure your own buckets (Helm)
        • Configure an external in-memory cache (Helm)
        • Enable Google Basemaps (Helm)
        • Enable the CARTO Data Warehouse (Helm)
        • Configure an external proxy (Helm)
        • Enable BigQuery OAuth connections (Helm)
        • Configure Single Sign-On (SSO) (Helm)
        • Use Workload Identity in GCP (Helm)
        • Use EKS Pod Identity in AWS (Helm)
        • Enable Redshift imports (Helm)
        • Migrating CARTO Self-hosted installation to an external database (Helm)
        • Advanced customizations (Helm)
        • Configure your custom service account (Helm)
    • Maintenance
      • Maintenance (Kots)
        • Updates
        • Backups
        • Uninstall
        • Rotating keys
        • Monitoring
        • Change the Admin Console password
      • Maintenance (Helm)
        • Monitoring (Helm)
        • Rotating keys (Helm)
        • Uninstall (Helm)
        • Backups (Helm)
        • Updates (Helm)
    • Support
      • Get debug information for Support (Kots)
      • Get debug information for Support (Helm)
    • CARTO Self-hosted Legacy
      • Key concepts
        • Architecture
        • Deployment requirements
      • Quickstarts
        • Single VM deployment (docker-compose)
      • Guides
        • Configure your own buckets
        • Configure an external in-memory cache
        • Enable Google Basemaps
        • Enable the CARTO Data Warehouse
        • Configure an external proxy
        • Enable BigQuery OAuth connections
        • Configure Single Sign-On (SSO)
        • Enable Redshift imports
        • Configure your custom service account
        • Advanced customizations
        • Migrating CARTO Self-Hosted installation to an external database
      • Maintenance
        • Updates
        • Backups
        • Uninstall
        • Rotating keys
        • Monitoring
      • Support
    • Release Notes
  • CARTO Native App for Snowflake Containers
    • Deploying CARTO using Snowflake Container Services
  • Get Help
    • Legal & Compliance
    • Previous libraries and components
    • Migrating your content to the new CARTO platform
Powered by GitBook
On this page
  • Cluster Time Series
  • Composite Score Supervised
  • Composite Score Unsupervised
  • Cronbach Alpha Coefficient
  • Detect Space-time Anomalies
  • Detect Spatial Anomalies
  • Getis Ord
  • Getis Ord Spacetime
  • GWR
  • Hotspot Analysis
  • Local Moran's I
  • Moran's I
  • Spacetime Hotspots Classification

Was this helpful?

Export as PDF
  1. CARTO User Manual
  2. Workflows
  3. Components

Statistics

PreviousSpatial OperationsNextTileset Creation

Last updated 3 months ago

Was this helpful?

Components to perform spatial statistics calculations.

These components require installed in the chosen connection to build the workflow.

Cluster Time Series

Description

This component computes a clusterization based on the values of a feature over time (i.e. sales of different stores, temperature in different H3 cells, etc) and assigns a cluster number to each, using different methods to evaluate the values and their distribution over time.

Inputs

  • Input table: This component expects a table containing a column with at least a timestamp column, a value column and an index to be used as partition (usually an spatial index or a feature id)

Settings

  • Timestamp column: Select a column in the input that contains timestamps/dates.

  • Partition column: Each unique value on this column will be assigned a cluster.

  • Value column: Clusters will be calculated based on the value of this column along time

  • Number of clusters: Define the number of clusters to be generated.

  • Method:

    • Value

    • Profile

Outputs

  • Output table: This component will generate a table with the following columns:

    • Partitioning column with the same name as the given column from inputs.

    • Cluster (String): Contains the assigned cluster number.

Composite Score Supervised

Description

This component derives a spatial composite score as the residuals of a regression model which is used to detect areas of under- and over-prediction. The response variable should be measurable and correlated with the set of variables defining the score. For each data point, the residual is defined as the observed value minus the predicted value. Rows with a NULL value in any of the input variables are dropped.

Inputs

  • Input table

Settings

  • Column with unique geographic identifier (spatial index)

  • Input variables

  • Response variables

  • Model regressor

  • Model options

  • Output formatting

  • R-squared threshold

Outputs

  • Output table

External links

Composite Score Unsupervised

Description

This component combines (spatial) input variables into a meaningful composite score. The composite score can be derived using different methods, scaling and aggregation functions and weights. Rows with a NULL value in any of the model predictors are dropped.

Inputs

  • Input table

Settings

  • Column with unique geographic identifier (spatial index)

  • Input variables

  • Scoring method

    • Entropy

    • Custom Weights:

      • Variables and weights:When defining custom weights, you will find a button that controls the direction of the weight:

        • When specifying custom weights, the component will internally apply a normalization so that the sum of all weights is equal to 1

        • Scaling function:

          • MIN_MAX_SCALER: data is rescaled into the range [0,1] based on minimum and maximum values. Only numerical variables are allowed.

          • STANDARD_SCALER: data is rescaled by subtracting the mean value and dividing the result by the standard deviation. Only numerical variables are allowed.

          • RANKING: data is replaced by its percent rank, that is by values ranging from 0 lowest to 1. Both numerical and ordinal variables are allowed (categorical and boolean variables need to be transformed to ordinal).

          • DISTANCE_TO_TARGET_MIN(_MAX,_AVG):data is rescaled by dividing by the minimum, maximum, or mean of all the values. Only numerical variables are allowed.

          • PROPORTION: data is rescaled by dividing by the sum total of all the values. Only numerical variables are allowed.

        • Aggregation Function:

          • LINEAR: the spatial composite is derived as the weighted sum of the scaled individual variables.

          • GEOMETRIC: the spatial composite is given by the product of the scaled individual variables, each to the power of its weight.

      • First PC

        • Correlation variable: the spatial score will be positively correlated with the selected variable (i.e. the sign the spatial score is set such that the correlation between the selected variable and the first principal component score is positive).

        • Correlation threshold: the minimum absolute value of the correlation between each individual variable and the first principal component score.

  • Output formatting

    • None

    • Equal Intervals

    • Jenks

    • Quantiles

    • Return Range

Outputs

  • Result table

  • Lookup table:only available if output formatting is Equal Intervals, Jenks, or Quantiles

External links

Cronbach Alpha Coefficient

Description

This component computes the Cronbach Alpha Coefficient, which can be used to measure internal consistency of different variables.

This component can be used to determine wether a combination of variables are consistent enough to be used to create a Composite Score using one of the components above.

Inputs

  • Input table

Settings

  • Input variables

Outputs

  • Output table

External links

Detect Space-time Anomalies

Description

Inputs

  • Source table [Table]: The source table. It must contain the index_column, the date_column, the value_column, a column named value_column_baseline with the values that should be used as a baseline to detect the anomalies and, when the distributional model parameter is set to ‘GAUSSIAN’ also its variance, value_column_baseline_sigma2.

Settings

  • Index column [Column]: The name of the column with the unique geographic identifier of the spatial index, either ‘H3’ or ‘QUADBIN’.

  • Date column [Column]: The name of the column with the date identifier.

  • Value column [Column]: The name of the column with the variable for which the space-time anomalies should be detected.

  • Time frequency [Selection]: The temporal frequency of the data selected from one of the following: ‘SECOND’, ‘MINUTE’, ‘HOUR’, ‘DAY’, ‘WEEK’, ‘MONTH’, ‘QUARTER’, ‘YEAR’.

  • Distributional model [Selection]: The distributional model of the data, either ‘POISSON’ or ‘GAUSSIAN’.

  • Estimation method [Selection]: The estimation method, either ‘EXPECTATION’ or ‘POPULATION’ for the expectation- and population-based methods respectively.

  • Prospective analysis [Boolean]: Option to specify if the analysis is retrospective or prospective. In a prospective analysis, only temporal zones that end with the last timestamp are considered and the interest lies in detecting new emerging anomalies. In a retrospective analysis instead, the space-time anomalies can happen at any point in time over all the past data (a temporal zone can end at any timestamp)

  • High-mean anomalies [Boolean]: Option to specify if the analysis is for detecting space-time zones higher or lower than the baseline.

  • K-ring size (advanced, optional) [Range]: The minimum and maximum k-ring size used to define the spatial zones.

  • Temporal window (advanced) [Range]: The minimum and maximum temporal window used to define the temporal zones.

  • Number of permutations (advanced) [Number]: The number of permutations used to derive the random replicas to test the anomaly statistical significance

  • Max results (advanced) [Number]: The maximum number of space-time zones returned

Outputs

  • Output table [Table]: The output table

External links

Detect Spatial Anomalies

Inputs

  • Source table [Table]: The source table. It must contain the index_column, the date_column, the value_column, a column named value_column_baseline with the values that should be used as a baseline to detect the anomalies and, when the distributional model parameter is set to ‘GAUSSIAN’ also its variance, value_column_baseline_sigma2.

Settings

  • Index column [Column]: The name of the column with the unique geographic identifier of the spatial index, either ‘H3’ or ‘QUADBIN’.

  • Value column [Column]: The name of the column with the variable for which the spatial anomalies should be detected.

  • Distributional model [Selection]: The distributional model of the data, either ‘POISSON’ or ‘GAUSSIAN’.

  • Estimation method [Selection]: The estimation method, either ‘EXPECTATION’ or ‘POPULATION’ for the expectation- and population-based methods respectively.

  • High-mean anomalies [Boolean]: Option to specify if the analysis is for detecting space-time zones higher or lower than the baseline.

  • K-ring size (advanced, optional) [Range]: The minimum and maximum k-ring size used to define the spatial zones.

  • Number of permutations (advanced) [Number]: The number of permutations used to derive the random replicas to test the anomaly statistical significance

  • Max results (advanced) [Number]: The maximum number of space-time zones returned

Outputs

  • Output table [Table]: The output table

External links

Getis Ord

Description

This component computes the Getis-Ord Gi* statistic for each spatial index in the source table.

Inputs

  • Source table [Table]

Settings

  • Index column [Column]

Make sure that the input data for this component doesn't contain any NULL values on the index column. Otherwise the execution will fail.

  • Value column [Column]

  • Kernel function for spatial weights [Selection]

  • Size [Number]

Outputs

  • Result table [Table]

External links

Getis Ord Spacetime

Description

This component computes the spacetime Getis-Ord Gi* statistic for each spatial index and datetime timestamp in the source table.

Inputs

  • Source table [Table]

  • Index column [Column]

Make sure that the input data for this component doesn't contain any NULL values on the index column. Otherwise the execution will fail.

  • Date column [Column]

  • Value column [Column]

  • Kernel function for spatial weights [Selection]

  • Kernel function for temporal weights [Selection]

  • Size [Number]

  • Temporal bandwidth [Number]

  • Time interval [Selection]

Outputs

  • Result table [Table]

External links

GWR

Description

This component runs a spatial-index-based Geographically Weighted Regression (GWR) model.

Inputs

  • Input table [Table]

  • Index column [Column]

  • Feature variables [Column]

  • Target variable [Column]

  • K-ring size [Number]

  • Kernel function [Selection]

  • Fit intercept [Boolean]

Outputs

  • Result table [Table]

External links

Hotspot Analysis

Description

This component is used to locate hotspot areas by calculating a combined Getis-Ord Gi* statistic using a uniform kernel over one or several variables.

Inputs

  • Source table [Table]

  • Index column [Column]

  • Input variables [Column]

  • Variable weights [String]

  • K-ring size [Number]

  • Significance level [Number]

Outputs

  • Result table [Table]

External links

Local Moran's I

Description

This component computes the local Moran's I statistic for each spatial index in the source table.

Inputs

  • Source table [Table]

  • Index column [Column]

  • Value column [Column]

  • Size [Number]

  • Decay function [Selection]

  • Permutations [Number]

Outputs

  • Result table [Table]

External links

Moran's I

Description

This component computes the Moran's I statistic for each spatial index in the source table.

Inputs

  • Source table [Table]

  • Index column [Column]

  • Value column [Column]

  • Size [Number]

  • Decay function [Selection]

Outputs

  • Result table [Table]

External links

Spacetime Hotspots Classification

Description

Inputs

Settings

  • Index column: Select a column in the input that contains a spatial index (Quadbin or H3).

  • Date column: Select a column in the input that contains date/timestamp.

  • Gi Value: Select a column in the input that contains a Gi value generated by a Getis Ord Spacetime component.

  • P Value: Select a column in the input that contains a P value generated by a Getis Ord Spacetime component.

  • Threshold: Select the threshold of the P value for a location to be considered as hotspot/coldspot.

  • Algorithm: Select the algorithm to be used for the monotonic trend test:

    • Mann-Kendall (default)

    • Modified Mann-Kendall

Output

  • Output table: The component will return a table with the column that contains the index and the resulting columns from the procedure's execution.

By default, it is set as "Direct" (), which means that the variable's value will weight in to make the final score higher.

If set to "Inverse" (), the variable's value will weight in to make the final score lower.

This component can be used to detect space-time regions where the variable of interest is higher (or lower) than its baseline value. It implements the developed in to detect space-time regions where the variable of interest is higher (or lower) than its baseline value.

This component can be used to detect spatial regions where the variable of interest is higher (or lower) than its baseline value. It implements the developed in to detect spatial regions where the variable of interest is higher (or lower) than its baseline value.

This component requires installed in the chosen connection to build the workflow.

This component takes the output of and classifies each location into specific types of hotspots or coldspots, based on patterns of spatial clustering and intensity trends over time.

Input table: This component expects to be connected to the output of a successfully executed node, or a table that contains that same result.

the CARTO Analytics Toolbox
scan statistics framework
this R package
BigQuery reference
scan statistics framework
this R package
BigQuery reference
BigQuery reference to H3 procedure
BigQuery reference to Quadbin procedure
BigQuery reference to H3 procedure
BigQuery reference to Quadbin procedure
BigQuery reference
the CARTO Analytics Toolbox
BigQuery reference
BigQuery reference to H3 procedure
BigQuery reference to Quadbin procedure
BigQuery reference to H3 procedure
BigQuery reference to Quadbin procedure
Getis Ord Spacetime
Getis Ord Spacetime
Snowflake reference
Snowflake reference
Snowflake reference
BigQuery reference
BigQuery reference
BigQuery reference