Hi! 👋 I am currently pursuing an M.S. in Machine Learning at the University of Maryland, College Park, and am part of Prof. Tianyi Zhou’s research lab. My research broadly encompasses machine learning with specific interests in computer vision, natural language processing, multimodal techniques, and generative AI. I am particularly passionate about building intelligent agents that leverage reasoning capabilities to effectively perform vision-related tasks and decision-making processes. Previously, I earned my B.Tech degree in Computer Science from Rajiv Gandhi Institute of Petroleum Technology where I researched on applying computer vision techniques in healthcare, construction and urban analytics.

I am actively seeking internship opportunities for Summer 2025 to further advance these research interests. I am open to collaborating—if my research interests and work resonate with you, feel free to reach out to discuss potential opportunities!

🔥 News

📝 Publications

Preprint

FaSTA*: Fast-Slow Toolpath Agent with Subroutine Mining for Efficient Multi-turn Image Editing [Project]

Advait Gupta, Rishie Raj, Dang Nguyen, Tianyi Zhou

FaSTA* is a neurosymbolic online learning, tool-use agent with fast-slow planning for complex multi-turn image editing tasks. It decomposes a task into subtasks and calls a sequence of AI tools to address each subtask. By learning a library of frequently used subroutines (subsequences of tools), it can rely on fast planning for most subtasks, and occasionally, lazily activate slow planning (which requires A* search) for rare and challenging subtasks that the learned library of subroutines cannot handle.

Preprint

CoSTA*: Cost-Sensitive Toolpath Agent for Multi-turn Image Editing [Project]

Advait Gupta, NandaKiran Velaga, Dang Nguyen, Tianyi Zhou

CoSTA* is a hybrid agent for multi-turn image editing that combines LLM-based reasoning with A* search for cost-efficient tool selection, balancing cost and quality. Unlike text-to-image models like Stable Diffusion and DALLE-3, which struggle with complex edits and retaining input image details, and agents like GenArtist, CLOVA, and VisProg, which perform poorly on multimodal, multi-step tasks, CoSTA* constructs an optimal toolpath using LLM-guided hierarchical planning and A* search. It dynamically adapts by refining tool effectiveness through real-time feedback from a VLM, ensuring robust and efficient execution.

SNPD 2023

From Above and Beyond: Decoding Urban Aesthetics with the Visual Pollution Index

Advait Gupta, Manan Padsala, Devesh Jani, Tanmay Bisen, Aastha Shayla, Gargi Srivastava

It is among the top papers to be accepted for publishing in Springer’s Studies in Computational Intelligence.

ICMLA 2023

From Sky to Strategy: Construction Activity Index and Stage Estimation From Drone-Captured Imagery

Advait Gupta, Manan Padsala, Aastha Shayla, Tanmay Bisen, Susham Biswas, Abhemanyu Sarin

INCET 2023

🚀 Personal Projects

  • Emotion-Based Poem Generation with GPT-2 đź“… 2023.07
    • The Emotion-Based Poem Generation project fine-tunes GPT-2 to create poems that reflect user-specified emotions. Using a curated dataset labeled with emotions from the NRC Emotion Lexicon, the model generates text that aligns with both style and sentiment. By leveraging emotion prompts, it ensures the output resonates with the intended tone while allowing users to customize poem length.
  • Billboard Advertisement Recommendation System đź“… 2023.06
    • The Billboard Advertisement Recommendation System is a dynamic platform that selects ads based on real-time traffic and pedestrian demographics. Using computer vision, machine learning, and content-based filtering, it analyzes the audience to display relevant ads.

🎖 Honors and Awards

  • 2023: Qualified for the Grand Finale of Smart India Hackathon 2023 organized by Government of India.
  • 2022: Among 40 students selected nationwide for the ACM Winter School on Optimization for ML and Operations Research.
  • 2020: Ranked among top 2% students in the Joint Entrance Examination - India.

đź“– Educations

  • 2024.08 - 2026.05 (Expected), MS in Machine Learning, University of Maryland- College Park.
  • 2020.12 - 2024.05, Bachelor of Technology in Computer Science, Rajiv Gandhi Institute of Petroleum Technology.

đź’» Internships

  • 2024.01 - 2024.03, Machine Learning Intern, Techpeek, India.
  • 2023.08 - 2023.05, Summer Intern, Ernst & Young, India.
  • 2023.01 - 2023.03, Machine Learning Intern, Spritle Software, India.
  • 2022.06 - 2022.11, Machine Learning Intern, AIEnsured (testAIng.com), India.