Overview
A tremendous amount of data is generated every day that needs to be stored, so that it can be easily accessed by people interested in information about certain topics, movie videos, or data for scientic research etc. Advances in almost every eld are now possible due to the availability of data for research, whether it be data about weather, DNA sequences, sales information data for inventory management and marketing, or data collected by satellites.
Most of this data is stored on disk arrays that are located in a centralized data center. These data centers are packing together racks and racks of equipment and are typically located where power is cheap since they use a tremendous amount of energy. In this project we focus on storage centric data centers containing thousands of distributed disks which store large quantities of data. Just as regular datacenters, thermal management in such large scale storage systems is a major problem. Heavy workload processing causes the disks to heat up leading to loss of reliability. This signicantly impacts cooling cost as well. In this project we are investigating techniques for thermal management in large scale storage systems comprising of thousands of storage devices and processing millions of data requests per day. Although a signicant amount of work exists for thermal and energy management in datacenters, temperature considerations in large scale storage systems have received little attention.
We focus on thermal and energy management questions by considering both how workload affects cooling costs and how temperature rise affects reliability. The main objective is to develop models and algorithms to optimize workload to minimize energy usage with a focus being to manage temperature spikes by adjusting workload distribution. In addition, we are interested in techniques that will permit a shut down or slow down of a signicant fraction of the system in times of lower than peak demand for data.
Applications
Disk access and usage patterns vary quite a bit across different applications. Our goal is to develop a general science of thermal management, that will guide us in data placement, data migration, data replication, data access task distribution, scheduling and control of disk speeds, for any large array of storage devices. Due to fluctuating workload, there is a tremendous potential to save energy and reduce thermal hotspots by shutting down or slowing down a signicant fraction of the disk system. In addition, by balancing workloads, we may be able to eliminate hot spots. The following applications would serve as a guide while developing such a thermal management scheme.
Streaming Video
In recent years companies such Netflix have made streaming video a reality, and the demand for this is expected to grow many-fold over the next few years. The typical data access pattern here may be composed of very large sequential reads, that each occupy a single disk for a period of time (maybe a few hours).
Large-scale Data Analysis
We consider two fairly similar application domains here: (1) Very large databases (e.g. scientic federations like Skyquery, genome databases) where the typical access pattern includes a set of user queries each of which may involve accessing a set of relations; (2) Large scale analytics made popular in recent years by the map-reduce framework, where the typical access pattern is similar (each user query accesses a set of files), with a major exception being that replication is typically built into the framework for performance and fault tolerance.
Transaction Processing
Here we may have a large database that needs to support possibly hundreds of thousands of transactions per second. The key difference from the above two application domains is that, the data accesses here are for small ??records?? and typically random. This also captures the behavior of many online Web services or websites like de.li.cious, Facebook etc. These applications span a range of access patterns, and we will use them to make our optimization problems concrete.
Publications
-
LP Rounding and Combinatorial Algorithms for Active Time and Busy Time Scheduling
J. Chang, S. Khuller and K. Mukherjee
Submitted (2013). -
Optimal Batch Schedules for parallel machines
F. Koehler and S. Khuller. 2013.
In Proceedings of the Workshop on Algorithms and Data Structures. -
Algorithms for the Thermal Scheduling Problem
K. Mukherjee, S. Khuller and A. Deshpande. 2013.
In Proceedings of the IPDPS Conference. -
A Min-Edge Cost Flow Framework for Capacitated Covering Problems
J. Chang and S. Khuller. 2013.
In Proceedings of the ALENEX Conference. -
To send or not to send: Reducing the cost of data transmission
L. Golubchik, S. Khuller, K. Mukherjee and Y. Yao. 2013.
In Proceedings of the INFOCOM Conference. -
SWORD: Scalable Workload-Aware Data Placement for Transactional Workloads;
A. Quamar, K. Ashwin Kumar, A. Deshpande. 2013.
Proceedings of the 16th International Conference on Extending Database Technology (EDBT), 2013 -
Data Placement and Replica Selection for Improving Co-location in Distributed Environments
K. Ashwin Kumar, A. Deshpande, S. Khuller. 2013.
CoRR Technical Report arXiv:1302.4168, 2013 -
Hone: "Scaling Down" Hadoop on Shared-Memory Systems
K. Ashwin Kumar, Jonathan Gluck, Amol Deshpande, Jimmy Lin. 2013.
VLDB Demo 2013. -
Set Cover revisited: Hypergraph Cover with Hard Capacities
B.Saha and S. Khuller. 2012.
In Proceedings of the 39th International Colloquium on Automata, Languages and Programming (ICALP '12). -
A model for minimizing active processor time
J. Chang, H. Gabow and S. Khuller. 2012.
In Proceedings of the 20th Annual European Symposium on Algorithms (ESA '12). -
New Approximation Results for Resource Replication Problems.
S. Khuller, B. Saha and K. K. Sarpatwar. 2012.
In Proceedings of The 15th International Workshop on Approximation Algorithms for Combinatorial Optimization Problems. -
Saving on Cooling: The Thermal Scheduling Problem
K. Mukherjee, S. Khuller, A. Deshpande. 2012.
SIGMETRICS 2012 (poster) -
Generalized machine activation problems
Jian Li and Samir Khuller. 2011.
In Proceedings of the Twenty-Second Annual ACM-SIAM Symposium on Discrete Algorithms (SODA '11). SIAM 80-94. -
Energy efficient scheduling via partial shutdown
Samir Khuller, Jian Li, and Barna Saha. 2010.
In Proceedings of the Twenty-First Annual ACM-SIAM Symposium on Discrete Algorithms (SODA '10). Society for Industrial and Applied Mathematics, Philadelphia, PA, USA, 1360-1372. -
Resource-aware architectures for particle filter based visual target tracking
D. Forte, A. Srivastava. 2011.
Green Computing Conference and Workshops (IGCC), 2011 International , vol., no., pp.1-6, 25-28 July 2011 -
Energy-aware video storage and retrieval in server environments
D. Forte, A. Srivastava. 2011.
Green Computing Conference and Workshops (IGCC), 2011 International , vol., no., pp.1-6, 25-28 July 2011 -
Adaptable architectures for distributed visual target tracking
D. Forte, A. Srivastava. 2011.
Computer Design (ICCD), 2011 IEEE 29th International Conference on , vol., no., pp.339-345, 9-12 Oct. 2011 -
Energy-aware and quality-scalable data placement and retrieval for disks in video server environments
D. Forte, A. Srivastava. 2011.
Computer Design (ICCD), 2011 IEEE 29th International Conference on , vol., no., pp.457-458, 9-12 Oct. 2011 -
Thermal and Power-Aware Task Scheduling for Hadoop Based Storage Centric Datacenters
B. Shi, A. Srivastava. 2010.
1'st International Green Computing Conference (IGCC'10), Aug. 2010 -
Unified Datacenter Power Management Considering On-Chip and Air Temperature Constraints
B. Shi, A. Srivastava. 2011.
Special issue of Sustainable Computing: Informatics and Systems journal (SUSCOM), Feb. 2011 -
Thermal and Power-Aware Task Scheduling and Data Placement for Storage Centric Datacenters
B. Shi, A. Srivastava. 2012.
Handbook of Energy-Aware and Green Computing, Vol.1, 2012 -
Generalized Parallel Batching Framework for Serialized Workloads
K. Ashwin Kumar, C. Morrey III, A. Deshpande. 2012.
Unpublished manuscript, 2012
Acknowledgement
This material is based upon work supported in part by the National Science Foundation grants CCF-0728839 and 0937865.
Who are we?
Faculty
-
Samir Khuller (PI)
Professor
-
Amol Deshpande (Co-PI)
Associate Professor
-
Ankur Srivastava (Co-PI)
Associate Professor