Tues/Thurs. 3:30 p.m. - 4:45 p.m.
|
|
Who to
contact for a hard copy of papers:
We are located in A.V. Williams Building, room 4157 |
Class Location |
J.M. Patterson Building, Room 1109 Bldg. Number: 083 Location: Northeast Quad D-2 |
Please Note: There will be a room change as of February 3rd. We will meet in the UMIACS Conference Room, located in the A.V. Williams Building, room 2120. |
Tools
and Techniques for Very Large Scale Data Intensive
Applications The course is a survey of database systems that target sensor, scientific and statistical applications as well as systems used for data cube and data mining analyses. We will evaluate database architectures for their ability to efficiently support data access and computational requirements posed by high end applications. The course will cover algorithmic, systems, API and user interface issues. We will also carry out a targeted survey of data intensive applications and their requirements for database support to provide the applications context for evaluating high end database technology. |
Dates | Presentations |
Wk 1: 1/24 1/26 |
|
Chialin Chang January 26 |
Chang, C., Acharya, A., Sussman, A., and Saltz, J. T2: A Customizable Parallel Database for Multi-dimensional Data. Technical Report CS-TR-3867 and UMIACS-TR-98-04, University of Maryland, Department of Computer Science and UMI ACS, January (1998). To appear in ACM SIGMOD Record, March 1998. ftp://hpsl.cs.umd.edu/pub/papers/ADR-tr.ps.Z. |
Chialin Chang January 26 |
Chang, C., Moon, B., Acharya, A., Carter S., Sussman, A., and Saltz, J. Titan: A High Performance Remote-Sensing Database. Proceedings of the 1997 International Conference on Data Engineering, 375--384, April 1997. ftp://hpsl.cs.umd.edu/pub/papers/icde97-final.ps.Z |
Renato Ferreira January 26 |
Ferreira, R., Moon, B., Humphries, J., Sussman, A., Saltz, J., Miller, R., and Demarzo, A. The Virtual Microscope. In Proceedings of the 1997 AMIA Annual Fall Symposium, 449-453. American Medical Informatics Association, October 1997. ftp://hpsl.cs.umd.edu/pub/papers/amia97.ps.Z. |
Wk 2 &
3: 2/3 2/5 2/10 2/12 |
Earth Science and Medical Databases Client--Server Paradise and Geo-Spatial DBMS slides SEQUOIA slides and Sequoia Benchmark |
Mike Beynon February 3 |
|
Hubert Tsang February 5 |
|
Norina Dixon February 10 |
|
John Davis February 12 |
|
Wk 4:
2/17 2/19 |
Parallel Database Systems and Query Optimization |
Jerome Brown February 17 |
DeWitt, D. and Gray, J. Parallel Database Systems: The Future of High Performance Database Systems. Communications of the ACM, 35(6), 85--98. June 1992. |
Nonetta Pierre February 19 |
Graefe, G. Query
Evaluation Techniques for Large Databases. ACM
Computing Surveys 25(2), 73-170. June 1993. file://ftp.cs.pdx.edu/pub/faculty/graefe/papers/qeval.survey.ps |
Wk 5:
2/24
2/26 |
Tertiary Storage |
Yuan-Shin Hwang February 24 |
Prabhakar S., Agrawal D., Abbadi A.E., and Singh A. Tertiary Storage: Current Status and Future Trends. Computer Science Department, University of California, Santa Barbara TRCS96-21, August 1996.http://www.cs.ucsb.edu/TRs/techreports/TRCS96-21.ps |
Renato Ferreira February 24 |
Sarawagi, S. and Stonebraker, M. Reordering Query Execution in Tertiary Memory Databases. In Proceedings of the 22nd VLDB Conference,156--167, Morgan Kaufmann Publishers, Inc. 1996. http://SunSite.Informatik.RWTH-Aachen.DE/dblp/db/conf/vldb/SarawagiS96.html |
Renato Ferreira February 26 |
Cabrera, L.-F., Rees, R., and Hineman, W. Applying Database Technology in the ADSM Mass Storage System. In Proceedings of the 21st VLDB Conference, 597-605. Morgan Kaufmann Publishers, Inc., 1995. |
Renato Ferreira February 26 |
Yu, J. and DeWitt, D. J. Query Pre-Execution and Batching in Paradise: A Two-Pronged Appraoach to the Efficient Processing of Queries on Tape-Resident Data Sets. In 9th International Conference on Scientific and Statistical Database Management (SSDBM '97). IEEE Computer Society Press, 1997. http://www.cs.wisc.edu/paradise/paradise.papers.html. |
Wk
6: 3/3 3/5 |
Client-Server |
Rob Bennett March 3 March 5 |
|
Wk
7: 3/10
3/12 |
Object-Relational Database Systems Of Objects and Database slides and Enhanced Abstract Data Types slides On-line Analytical Processing (OLAP) |
Asmara Afework March 10 |
|
Charlie
Chang March 12 |
|
Wk
8: 3/17 3/19 |
On-line
Analytical Processing (OLAP) OLAP Data and Multidimensional Aggregates slides Faculty Candidate Talk |
Henrique Andrade March 17 |
|
Faculty Candidate Talk March 19 |
Department Lecture Series SPRING 1998 Speaker: Amin Vahdat Affiliation: UC - Berkeley Location: AVW 3258 Time: 4:00 p.m. Thursday, Mar 19 (Refreshments at 3:30 in AVW 1152) Title: Operating System Services For Wide-Area Applications Abstract: This talk examines system support issues for wide-area applications given the opportunity posed by remotely programmable resources. The development of a number of compelling wide-area applications such as Internet commerce, remote agents, online gaming, and news transmission has helped us identify a common set of application requirements, including: (i) naming of remote, potentially migrating objects, (ii) coherent access to global data, (iii) safe execution of remote programs, and (iv) secure, authenticated access to global resources. Unfortunately, today such system support is implemented in an ad-hoc and application-specific manner. This talk describes some of the difficulties of developing wide-area applications and describes the design and implementation of WebOS, a unified set of system services designed to simplify application development and to more efficiently utilize wide-area resources. One demonstration of WebOS functionality is Rent-A-Server, a system that allows any Web server to dynamically replicate itself across the wide area in response to client access patterns. |
Wk
9: 3/24 3/26 |
spring break! |
Wk
10: 3/31 4/2 |
Parallel Mining slides |
Shamik
Sharma March 31 |
|
Mustafa
Uysal April 2 |
|
Wk
11: 4/7 4/9 |
|
Asmara
Afework April 7 |
|
Norina
Dixon April 9 |
|
Wk
12: 4/14 4/16 |
Indexing |
Henrique
Andrade April 14 |
|
Jerome
Brown April 16 |
|
Wk
13: 4/21 4/23 |
Systems |
Anthony Tomasic April 23 |
Department of Computer Science Colloquium Speaker: Anthony Tomasic (INRIA Rocquencourt & Dyade) [Anthony Tomasic is a faculty candidate in the CS department] Date: Tuesday, April 21 Time: 3:30pm, reception following at 4:30pm Room: AV Williams 2460 Title: Parachute Queries Abstract: Mediator systems (aka heterogeneous databases) are used today in a wide variety of unreliable environments. When processing a query, a mediator may try to access a data source which is unavailable. In this situation, existing systems suffer from an Achilles' heel -- they typically either silently ignore unavailable data sources or generate an error. In either case, to obtain the complete answer, the query is reprocessed from scratch. This behavior is inefficient in environments with a non-negligible probability that a data source is unavailable (e.g., the Internet). In the case that some data sources are unavailable, the complete answer to a query cannot be obtained; however useful work can be done with the available data sources. In this talk, after some suitable marketing, we describe a novel approach to mediator query processing where, in the presence of unavailable data sources, the answer to a query is a `partial answer.' The partial answer represents the state of the mediator at the end of query processing, i.e., materialized data. This state is used to construct an `incremental query.' The answer to the incremental query is the same as the complete answer, but it is more efficient to evaluate than the original query. In addition, information can be extracted from the mediator state through the use of secondary queries, called `parachute queries.' We define two new architectures for partial answers, incremental and parachute queries and analyze several properties of these architectures. Our analysis shows that parachute queries can be viably added to existing mediator systems. Joint work with Philippe Bonnet (Bull Inc. & Dyade) |
Nonetta Pierre April 23 |
|
Wk
14: 4/28 4/30 |
|
Leana Golubchik April 28 |
Gemmell, D. J., Vin, H. M., and Kandlur, D. D., Rangan, P. V., and Rowe, L. A. Multimedia Storage Servers: A Tutorial IEEE Computer, 28(5), 40--49, May 1995. http://www.research.microsoft.com/research/BARC/JGemmell/computer95.ps |
Mike Franklin April 30 |
|
Wk
15: 5/5 5/7 |
Video Databases and Query by Image slides Systems |
John
Davis May 5 |
|
Mustafa Uysal May 7 |
|
Wk
16: 5/12 Last Day |