Classifying Human Activities in Urban Spaces with a Multimodal AI: Towards a Massive Assessment of Urban Affordances

Blecic I.;Floris A.;Giliberto G.;Trunfio G. A.
2026-01-01

Abstract

We present a tool that leverages a Multimodal Large Language Model (MLLM) for the automatic classification of human activities from images of urban scenes. Starting from an image of spaces populated with people, the tool is capable to classify them according to five features: age group, sex, bodily posture, activity level and social configuration. The tool implements a sequential pipeline consisting of Faster R-CNN for person detection, followed by postprocessing and two consecutive applications of GPT-4o models for refined image description and information extraction. In the paper we also present an experimental test used to preliminary validation of the tool, comparing the ground truth on 24 images of urban scenes with the estimates provided by the tool, yielding a good degree of alignment. The tool is part of the wider research programme of massive assessment of urban affordances, within the framework of the capability approach.
2026
Inglese
Lecture Notes in Computer Science
9783031976063
Springer Science and Business Media Deutschland GmbH
15890
341
357
17
Workshops of the International Conference on Computational Science and Its Applications, ICCSA 2025
Comitato scientifico
2025
tur
scientifica
capability approach
Multimodal Large Language Model
urban affordances
urban evaluation modelling
urban space analysis
no
4 Contributo in Atti di Convegno (Proceeding)::4.1 Contributo in Atti di convegno
Blecic, I.; Floris, A.; Giliberto, G.; Trunfio, G. A.
273
4
4.1 Contributo in Atti di convegno
embargoed_20260629
info:eu-repo/semantics/conferencePaper
Files in This Item:
File Size Format  
IRIS_Paper ICCSA2025 final.pdf

embargo until 29/06/2026

Type: Author’s Accepted Manuscript AAM, Post-print, (version accepted by the publisher)
Size 2 MB
Format Adobe PDF
2 MB Adobe PDF & nbsp; View / Open   Request a copy

Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.

Questionnaire and social

Share on:
Impostazioni cookie