|
International Journal of Computer Applications
Foundation of Computer Science (FCS), NY, USA
|
| Volume 187 - Issue 81 |
| Published: February 2026 |
| Authors: Shailendra Singh Kathait, Ashish Kumar, Samay Sawal |
10.5120/ijca2026926403
|
Shailendra Singh Kathait, Ashish Kumar, Samay Sawal . An NLP-Driven Intelligent Video Query System for Interactive Video Retrieval. International Journal of Computer Applications. 187, 81 (February 2026), 1-6. DOI=10.5120/ijca2026926403
@article{ 10.5120/ijca2026926403,
author = { Shailendra Singh Kathait,Ashish Kumar,Samay Sawal },
title = { An NLP-Driven Intelligent Video Query System for Interactive Video Retrieval },
journal = { International Journal of Computer Applications },
year = { 2026 },
volume = { 187 },
number = { 81 },
pages = { 1-6 },
doi = { 10.5120/ijca2026926403 },
publisher = { Foundation of Computer Science (FCS), NY, USA }
}
%0 Journal Article
%D 2026
%A Shailendra Singh Kathait
%A Ashish Kumar
%A Samay Sawal
%T An NLP-Driven Intelligent Video Query System for Interactive Video Retrieval%T
%J International Journal of Computer Applications
%V 187
%N 81
%P 1-6
%R 10.5120/ijca2026926403
%I Foundation of Computer Science (FCS), NY, USA
Efficient analysis of large-scale urban surveillance video remains a critical challenge for traffic management authorities. This paper introduces a unified video query system that enables naturallanguage– driven retrieval of traffic violation events from continuous CCTV feeds. This approach builds on state-of-the-art deeplearning detectors for a diverse set of infractions—including helmet non-compliance and cycle-lane misuse, illegal parking, overspeeding and wrong-way driving, and pedestrian tracking via facial recognition and augments them with fine-grained attribute extraction (vehicle type, color, carrying objects, timestamp, and spatial region). Detected events are stored in a multi-attribute database that supports compound filters. An integrated large language model (LLM) translates free-form user queries into structured query specifications (e.g., “Show me all red motorcycles speeding above 60 km/h between 6 AM and 8 AM”), automatically resolving synonyms, time-range interpretations, and attribute mappings. Retrieved results are presented as ranked frame sequences, complete with annotated bounding boxes and metadata, and can be reviewed via an interactive dashboard. This system demonstrates that natural-language–based video querying, when tightly coupled with a rich, structured violation index, can dramatically accelerate incident investigation and support data-driven traffic enforcement.