8000 GitHub - seshing/OpenFACADES: An Open Framework for Architectural Caption and Attribute Data Enrichment via Street View Imagery
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content

seshing/OpenFACADES

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

79 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

text

Tutorial preview: [Google Colab]

OpenFACADES

An Open Framework for Architectural Caption and Attribute Data Enrichment via Street View Imagery

Overview

overview

To Do List

  • Release code for building data harmonization.
  • Release code for integrating building and street view imagery data.
  • Develop Google Colab tutorial for retriving building image data.
  • Develop Google Colab tutorial for building labeling and captioning.
  • Release fine-tuned model (1B, 2B).
  • Release global building dataset.
  • Release training data.
  • Integrate more SVI platforms into the framework.

Installation

To install OpenFACADES, follow these steps:

  1. Clone the repository:
git clone https://github.com/seshing/OpenFACADES.git
  1. Install the package and required dependencies:
pip install -e OpenFACADES/.
pip install -r OpenFACADES/requirements.txt

Note: The package used pytorch and torchvision, you may need to install them separately. Please refer to the official website for installation instructions.

What can our method do?

  1. Integrating multimodal crowdsourced data: acquire building data and street view imagery from crowdsourced platforms for selected areas, and conduct isovist analysis to integrate them.

  2. Retrieving building image data: perform object detection to identify target buildings in panoramic images and reproject them back to a holistic perspective view, with image filtering functions to select high-quality building images. detect

  3. Establishing dataset and multimodal models: apply state-of-the-art multimodal large language models to annotate building images with multiple attributes, including building type, surface material, number of floors, and building age, and provide detailed descriptive captions.

vlm
(a) building attributes labeling

vlm
(b) image captioning

Quick start

To acquire individual building images (Steps 1 & 2 above) for an area, you can simply run:

python OpenFACADES/run.py \
  --bbox=[left,bottom,right,top] \
  --api_key='YOUR_MAPILLARY_API_KEY'

Note: please check Mapillary has panoramic images available for the selected area.

Example bbox:
[8.552,47.372,8.554,47.376]: an area in Zurich, Switzerland;
[-81.382,28.540,-81.376,28.543]: an area in Orlando, the US;
[-70.660,-33.442,-70.655,-33.437]: an area in Santiago, Chile;
[-73.578,45.497,-73.569,45.502]: an area in Montreal, Canada;
[37.618,55.758,37.628,55.763]: an area in Moscow, Russia;
[25.273,54.684,25.283,54.687]: an area in Vilnius, Lithuania.

Output paths:
building footprint: output/01_data/footprint.geojson;
detected building images: output/02_img/individual_building;
building image ids after filtering: output/02_img/individual_building_select.csv.

Acknowledgement

We acknowledge the contributors of OpenStreetMap, Mapillary and other platforms for providing valuable open data resources and code that support street-level imagery research and applications.

About

An Open Framework for Architectural Caption and Attribute Data Enrichment via Street View Imagery

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

0