Skip to content
View madhav1ag's full-sized avatar

Block or report madhav1ag

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 250 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
madhav1ag/README.md

Hi,

I am a Ph.D. student at IDCOM in the School of Engineering, University of Edinburgh. I am a member of Vision Group and VIOS, working under the supervision of Dr. Steven McDonagh and Dr. Laura Sevilla. My interest lies in Multimodal Learning, Spatial-Temporal Understanding in Foundation Models, and Generative AI.

Before moving to the UK, I spent a wonderful year in Germany working on building lip-syncing and synthetic media generation models. I also spent three months at Visual Computing & Artificial Intelligence group at Technical University of Munich with Prof. Matthias Nießner.

I completed MS by Research at CVIT, IIIT Hyderabad under the guidance of Prof. C.V. Jawahar and Prof. Vinay P. Namboodiri. My graduate research focused on Lip-Sync, Talking Head Generation, and Face Reenactment, along with their optimization for real-world problems. Additionally, I worked on the task of Table Detection in Document Images with high accuracy under the supervision of Prof. C.V. Jawahar and Dr. Ajoy Mondal. Prior to this, I worked as a Data Scientist and a team lead with several companies, broadly in the domains of Facial Recognition, Video Surveillance using AI, and Document Image Processing.

Google Scholar | LinkedIn | CV

GitHub Stats

mdv3101

Madhav's GitHub stats Top Langs

Pinned Loading

  1. CDeCNet CDeCNet Public

    CDeC-Net: Composite Deformable Cascade Network for Table Detection in Document Images

    Python 133 33

  2. AVFR-Gan AVFR-Gan Public

    Audio-Visual Generative Adversarial Network for Face Reenactment

    158 10

  3. DeepSort_Yolo DeepSort_Yolo Public

    Real Time Person Tracking using DeepSort and Yolo_v4

    Jupyter Notebook 13 9

  4. Rethinking_Generalization Rethinking_Generalization Public

    SMAI Project: Understanding Deep Learning Requires Rethinking Generalization

    Jupyter Notebook 6 1

  5. Digital_Handwriting Digital_Handwriting Public

    Developing a system that will transform one handwriting to another pre-trained handwriting with the help of Long Short Term Memory (LSTM) Networks Technology Used : Deep Learning & Neural Networks,…

    Python 1

  6. Beer-Label-Classification-using-SIFT Beer-Label-Classification-using-SIFT Public

    This project is part of the course CSE478: Digital Image Processing, Monsoon 2020, IIIT-Hyderabad. It classify beer labels using SIFT algorithm.

    Jupyter Notebook 2