Publications

(2024). ReVisionLLM: Recursive Vision-Language Model for Temporal Grounding in Hour-Long Videos. In ArXiv.

PDF Cite Code

(2024). RGNet: A Unified Retrieval and Grounding Network for Long Videos. In ECCV.

PDF Cite Code Project

(2024). Context Matters: Leveraging Spatiotemporal Metadata for Semi-Supervised Learning on Remote Sensing Images . In ECAI.

PDF Cite

(2023). GRAtt-VIS: Gated Residual Attention for Auto Rectifying Video Instance Segmentation. In ICPR.

PDF Cite Code

(2022). InstanceFormer: An Online Video Instance Segmentation Framework. In AAAI23.

PDF Cite Code Poster

(2022). Box Supervised Video Segmentation Proposal Network. In IMVIP22.

PDF Cite Code Slides Video

(2022). COVID-DenseNet: A Deep Learning Architecture to Detect COVID-19 from Chest Radiology Images. In ICDSA22.

PDF Cite Code Slides

(2021). Prediction of Soft Proton Intensities in the Near-Earth Space Using Machine Learning. In ApJ21.

PDF Cite Code