I am a Senior Staff Research Scientist at Google’s Cambridge office (Massachusetts), working on multiple topics in machine learning: generative models (images and text/LLM’s), multimodal representation learning, computer vision and large language models. My research has been deployed in multiple Google PA’s such as Ads, YouTube and Cloud. From August 2013 to November 2014, I was a Postdoctoral Associate with Bill Freeman at MIT’s CSAIL Lab. In June 2013, I received my PhD from the Computer Science department at New York University , under the supervision of Rob Fergus. I was awarded a Microsoft Research PhD Fellowship for 2010-2011, a Dean’s Dissertation Fellowship for 2012-2013 from the Graduate School of Arts and Sciences and the Janet Fabri prize (2013-2014) for outstanding dissertation in Computer Science. I am also an active angel investor and startup advisor.

Email: dilipkay@gmail.com.

CV

Google Scholar

Publications and Products:

Patents (USPTO): List of issued and applied patents here.

Entrepreneurship/Angel Investing/Advising:

  • Angel investor/advisor: helm.ai, jabali.ai, AgShift, Piction Health, Folia Health, and many others.
  • Co-founder and CTO of Nirvana Digital, later acquired by Black Magic Designs. Nirvana Digital developed Revival, a film and video restoration system, now used in post-production companies worldwide, and Resolve, a world-leading solution that combines editing, color correction, visual effects, motion graphics and audio post production in one tool. These products are installed on millions of laptops worldwide. Resolve was featured in Apple’s 2022 Keynote.

Professional Activities: Reviewer/Area Chair for the following conferences and journals:

  • NeurIPS, ICML, ICLR, ICCV, CVPR, ECCV etc.

Talks (not updated recently):

  • Efficient Preconditioning for Laplacian Matrices. Microsoft Research, Cambridge, England, January 2014.
  • Invited Talk – Fast Image Deconvolution Using Hyper Laplacian Priors. Group Meeting of Fredo Durand’s graphics group, MIT (CSAIL), October 22, 2010.
  • Invited Talk – Dark Flash Photography and Fast Image Deconvolution Using Hyper Laplacian Priors. Rick Szeliski’s Interactive Visual Media Group, Microsoft Research, Seattle, December 3, 2009.
  • Dark Flash Photography. SIGGRAPH 2009, New Orleans, August 7, 2009.
  • Dark Flash Photography. Group meeting of Laboratory for Computational Vision, NYU, May 28, 2009.
  • Dark Flash Photography. Group meeting of Bill Freeman’s Vision Group, MIT (CSAIL), May 19, 2009.
  • Dark Flash Photography. Graphics Seminar, NYU, May 8, 2009.