Skip to main content

GeoMesa

GeoMesa is an open source, distributed spatio-temporal indexing and analytics framework for storing, querying, and processing geospatial data at scale on big data platforms.

  • Distributed spatio-temporal indexing and query engine for geospatial data (geospatial data management)
  • Integration with distributed data stores such as Apache Accumulo, Apache HBase, Apache Cassandra, and Google Bigtable (database integration)
  • Support for GeoTools, GeoServer, and OGC-compliant spatial queries and services (geospatial standards and interoperability)
  • Streaming and batch ingestion of large geospatial data sets, including sensor and event data (data ingestion and Extract, Transform, Load (ETL))
  • Spatial-temporal analytics and query support over big data ecosystems, including Apache Hadoop and Apache Spark (big data analytics)

More About geomesa.org

GeoMesa is an open source project under the Eclipse Foundation that focuses on distributed storage, indexing, and querying of spatio-temporal data (geospatial data management) on top of widely used big data infrastructure. It addresses use cases where organizations need to handle large volumes of geospatial records, such as points, polygons, and trajectories, with associated timestamps, and query them with low latency in a distributed environment. The project is designed to work with clustered data stores so that geospatial workloads can be executed close to the data.

The core capability of GeoMesa is its spatio-temporal indexing layer (indexing engine), which encodes latitude, longitude, time, and other attributes into index structures that can be stored in distributed key-value or wide-column databases. This enables queries such as bounding box searches, time range filters, attribute predicates, and combinations of these, executed in a distributed manner. GeoMesa exposes these capabilities through APIs and through integration with GeoTools (geospatial toolkit), which allows applications to work with standard geospatial data models and query constructs.

GeoMesa provides pluggable backends for multiple distributed data stores (database integration), including Apache Accumulo, Apache HBase, Apache Cassandra, and Google Cloud Bigtable, as described in its official materials. It also works with filesystems and object stores in Hadoop ecosystems. This design allows enterprises to deploy GeoMesa on infrastructure they already operate, while maintaining a common geospatial query and data model. The system supports command-line tools and ingestion utilities that load geospatial data from various formats into the underlying stores, as well as converters that map input fields into GeoMesa schemas.

The project integrates with GeoServer (geospatial services) through GeoMesa data store plugins. This enables Open Geospatial Consortium (OGC) standard services such as WFS, Workload Management System (WMS), and Workload Placement Strategy (WPS) (geospatial standards) to operate over data held in distributed databases through GeoMesa. As a result, organizations can serve maps, vector features, and geospatial APIs backed by scalable storage, while clients interact using standard protocols. GeoMesa also supports common geospatial formats such as Shapefile, GeoJSON, and others via GeoTools and associated converters.

For analytics workloads, GeoMesa connects with Apache Spark (big data analytics) by exposing data stored in GeoMesa-backed tables as Spark RDDs or DataFrames, allowing spatial-temporal analysis using Spark’s processing engine. This is used for tasks such as clustering, filtering, joins, and aggregations on large geospatial datasets. GeoMesa can also ingest streaming data from systems like Kafka as documented in project materials, supporting near-real-time updates and queries for sensor and event streams.

In enterprise and institutional environments, GeoMesa is applied in domains such as situational awareness, asset tracking, environmental monitoring, and telecommunications (industry solutions), where there is a need to query and visualize large geospatial datasets across time. Its alignment with OGC standards, use of widely adopted big data platforms, and modular backend architecture position GeoMesa in the category of distributed geospatial data platforms and analytics frameworks (geospatial data platform). It serves as a bridge between traditional GIS tools and big data ecosystems by enabling standard geospatial workflows to operate on scalable storage and processing systems.