I'm a PhD candidate in the Department of Computer Science at the University of Maryland, College Park; co-advised by Amol Deshpande and Aravind Srinivasan. I also hold a Masters degree in Computer Science (2015) from University of Maryland, College Park, and a Bachelors degree in Computer Engineering (2011) from Veermata Jijabai Technological Institute, India.
I am interested in data management and data-intensive computing, with a particular focus on storage and query processing challenges in a dataset version control system. During the past years, I have been working on developing a platform to simplify and automate fundamental book-keeping operations in data science/analytics worflows, e.g., data collaboration and versioning, data provenance, and in-situ integration and search.
DEX: Query Execution in a Delta-based Storage System.
Amit Chavan, Amol Deshpande. SIGMOD Int'l Conf. on Management of Data, 2017.
ProvDB: A System for Lifecycle Management of Collaborative Analysis Workflows.
Hui Miao, Amit Chavan, Amol Deshpande. CoRR Technical Report arXiv:1610.04963, 2016.
Principles of Dataset Versioning: Exploring the Recreation/Storage Tradeoff.
Souvik Bhattacherjee, Amit Chavan, Silu Huang, Amol Deshpande, and Aditya Parameswaran. 41st International Conference on Very Large Data Bases (VLDB), Kohala Coast, Hawaii, USA. September 2015. [Slides]
Improved Bounds in Stochastic Matching and Optimization.
Alok Baveja, Amit Chavan, Andrei Nikiforov, Aravind Srinivasan, and Pan Xu. 18th International Workshop on Approximation Algorithms for Combinatorial Optimization Problems (APPROX), New Jersey, USA. August 2015.
Towards a unified query language for provenance and versioning.
Amit Chavan, Silu Huang, Amol Deshpande, Aaron Elmore, Samuel Madden, and Aditya Parameswaran. 7th International Workshop on Theory and Practice of Provenance (TaPP), Edinburgh, Scotland. July 2015. [Slides]
DataHub: Collaborative Data Science and Dataset Version Management at Scale.
Anant Bhardwaj, Souvik Bhattacherjee, Amit Chavan, Amol Deshpande, Aaron J. Elmore, Samuel Madden, and Aditya Parameswaran. 7th Biennial Conference on Innovative Database Research (CIDR), Asilomar, USA. January 2015.
3220, A. V. Williams Building,
Department of Computer Science,
University of Maryland, College Park.