The use of multi-modal models and machine learning tech-niques to improve the efficiency and accuracy of geospatial data analysis

Matthew C Gaskins

The use of multi-modal models and machine learning tech-niques to improve the efficiency and accuracy of geospatial data analysis

Matthew C Gaskins

Published: 06 May 2025, Last Modified: 29 May 2025VLMs4All 2025 PosterEveryoneRevisionsBibTeXCC BY 4.0

Keywords: Large language models, retrieval-augmented generation, computer vision, geospatial analysis, automated imagery processing, contextual data enrichment.

TL;DR: This paper introduces a hybrid geospatial analysis workflow that uses computer vision and generative AI to automate satellite imagery interpretation, improving speed, accuracy, and contextual understanding.

Abstract: Abstract: Geospatial data analysis is heavily reliant on human interpretation of large-scale imagery which leads to constraints in scalability. This study evaluates whether mul-ti-modal models can assist in overhead image understanding by accurately interpreting imagery and automating workflows. A hybrid machine learning solution using Over-sightML (OSML)—an open-source, cloud-based framework—is assessed for its ability to improve geospatial workflows. OSML integrates state-of-the-art computer vision with generative AI capabilities and streamlines preprocessing and detection aggregation. Results indicate that combining domain-specific CV models with foundation models offers a scalable and efficient alternative to manual analysis workflows

Submission Number: 9

Loading