Reading list
(selected from
CMU)
The Roots
| Background: Stonebraker
| and Hellerstein, Introduction to the Section "The Roots", pages 1-4
|
- Astrahan, M. , et al., "System R: Relational Approach to Database
Management", ACM TODS, 1(2), 1976 (*)
- Chamberlin, D., et al., A
History and Evaluation of System R, Communications of the ACM,
24(10), 1981 (*)
- Stonebraker, M., et al., (1976) "The Design and Implementation of INGRES",
ACM TODS, 1(3), 1976 (*)
- Stonebraker, M., (1980) Retrospection
on a Database System, ACM TODS, 5(2), 1980
Object-Oriented and Object-Relational Database Systems
- Lamb et al., The
ObjectStore System, Communications of the ACM, 34(10), 1991 (*)
- Stonebraker, M., "Inclusion of New Types in Relational Database Systems",
Proceedings of the IEEE Conference on Data Engineering, 1986 (*)
Concurrency Control
| Background:
| Gray and Reuter, Sections 7.3 and 7.4 |
- Gray, J., et al., "Granularity of Locks and Degrees of Consistency in a
Shared Database", IFIP Working Conference on Modelling of Database
management Systems, AFIPS Press, 1976 (*)
- H. T. Kung, John T. Robinson, "On Optimistic Methods for Concurrency
Control", VLDB 1979 (*)
- Philip L. Lehman, S. Bing Yao, "Efficient Locking for Concurrent
Operations on B-Trees", ACM TODS 6(4): 650-670, 1981 (*)
- Mohan, C., ARIES/KVL: A
Key-Value Locking Method for Concurrency Control of Multiaction Transactions
Operating on B-Tree Indexes, Proceedings of the 16th VLDB
Conference, Brisbane, August 1990
- Srinivasan, V., and Carey, M.,
Performance
of B-Tree Concurrency Control Algorithms, Proceedings of the ACM SIGMOD
Conference, Denver, CO, June 1991.
Logging and Recovery
| Background:
| Gray and Reuter, Chapters 9-11 |
- Mohan, C., et al., ARIES:
A Transaction Recovery Method Supporting Fine-Granularity Locking and Partial
Rollbacks Using Write-Ahead Logging, ACM TODS, 18(1), 1991 (*)
- Mohan, C., Repeating
History Beyond ARIES, Proceedings of VLDB conference, 1999 (*)
Access Paths and Indexes
- Gray and Reuter, Section 15.4 "B-Trees" (Recommendation: Read the
whole Chapter 15) (*)
- Ramakrishnan & Gehrke, Chapter 6, "Hash-Based Indexing" (*)
Buffer Management
| Background:
| Gray and Reuter, Chapter 13 |
- Chou, H., and DeWitt, D., An Evaluation of Buffer
Management Strategies for Relational Database Systems, Proceedings of
the 11th VLDB Conference, 1985 (*)
- O'Neil, E. , O'Neil, P., and Weikum, G.,
The
LRU-K Replacement Algorithm for Database Disk Buffering, Proceedings of
the ACM SIGMOD Conference, Washington, D.C., June 1993 (*)
Query Processing and Optimization
- Shapiro, L., Join
Processing in Database Systems with Large Main Memories, ACM TODS,
11(3), September 1986 (*)
- Goetz Graefe, The Value of
Merge-Join and Hash-Join in SQL Server, Proceedings of the VLDB
Conference, September 1999 (*)
- Ioannidis, Y., Kang, Y., Randomized Algorithms
for Optimizing Large Join Queries, Proceedings of the ACM SIGMOD
Conference, Atlantic City, NJ, May 1990
- T. Urhan & M. Franklin, XJoin: A
Reactively-Scheduled Pipelined Join Operator, IEEE Data Engineering
Bulletin, June 2000, pp. 27-33
- Graefe, G., Query
Evaluation Techniques for Large Databases, ACM Computing Surveys,
25(2), June 1993 (Read only Sections 1 through 5) (*)
- Graefe, G., Dynamic query
evaluation plans: Some course corrections?, IEEE Database
Engineering 23(2), Jume 2000 (*)
- Selinger, P., et al., Access Path Selection in
a Relational Database Management System, Proceedings of the ACM SIGMOD
Conference, Boston, MA, 1979 (*)
- Chaudhuri, S., An
Overview of Query Optimization in Relational Systems, Proceedings of
the ACM PODS Conference,, Seattle, WA, 1998 (*)
- V. Poosala, P.J. Haas, Y. Ioannidis, and E.J. Shekita. Improved
histograms for selectivity estimation of range predicates, Proceedings
of the ACM SIGMOD Conference, 1996
- IEEE Database Engineering, Special issue
on Query Processing in Commercial Database Systems, 16(4), Dec. 1993
- IEEE Database Engineering,
Special issue
on Adaptive Query Processing, 23(2), Dec. 1993
DBMS and the Operating System
- Stonebraker, M., Operating
System Support for Database Management, Communications of the ACM,
24(7), 1981 (*)
- Ramakrishnan & Gehrke, Section 3.3.2, "Buffer Management in DBMS
versus OS" (*)
Disks
- Ruemmler, C., and Wilkes, J., An
Introduction to Disk Drive Modelling, IEEE Computer, 27 (3), March
1994 (*)
- Gray, J., and Graefe, G., The Five Minute
Rule Ten Years Later and Other Computer Storage Rules of Thumb, ACM
SIGMOD Record, December 1997 (*)
- Gray, J., and Shenoy, P., Rules
of Thumb in Data Engineering, Proceedings of the IEEE Conference on
Data Engineering, 2000.
Sorting
| Background:
| Common Sorting Algorithms (quicksort, binary sort, etc)
|
|
| Ramakrishnan and Gehrke Chapter 11 |
- Nyberg, C., Barclay, T., Cvetanovic, Z., Gray, J., Lomet, D., AlphaSort: A RISC
Machine Sort, Proceedings of the ACM SIGMOD Conference, May 1994
(*)
- Agarwal, R., A
super scalar sort algorithm for RISC processors, Proceedings of the ACM
SIGMOD Conference, June 1996 (*)
- Andrea C. Arpaci-Dusseau, Remzi H. Arpaci-Dusseau, David E. Culler, Joseph
M. Hellerstein and David A. Patterson, High-performance
sorting on networks of workstations , Proceedings of the ACM SIGMOD
Conference, May 1997
DBMSs on Modern Processors
- A. Ailamaki, D. J. DeWitt, M. D. Hill, and D. A. Wood,
DBMSs on a
Modern Processor: Where Does Time Go?, Proceedings of the VLDB
Conference, September 1999 (*)
- K. Keeton, D. A. Patterson, Y. Q. He, R. C. Raphael, and W. E. Baker, Performance
Characterization of a Quad Pentium Pro SMP Using OLTP Workloads,
Proceedings of the 25th International Symposium on Computer
Architecture, Barcelona, Spain, June 1998 (*)
- J. L. Lo, L. A. Barroso, S. J. Eggers, K. Gharachorloo, H. M. Levy, and S.
S. Parekh. An
analysis of database workload performance on simultaneous multithreaded
processors, Proceedings of the 25th Annual International Symposium on
Computer Architecture, pages 39-50, June 1998.
Main-memory Database Systems
- Hector Garcia-Molina and Kenneth Salem, Main memory
database systems: an overview, TKDE 4(6), 1992 (*)
- Philip Bohannon et al., The architecture
of the Dali main-memory storage manager, Journal of Multimedia Tools
and Applications, 1997 (*)
More references are available in Mengzhi's last slide.
Benchmarking
| Background:
| Gray's Benchmark handbook, Chapter 1 |
- Anon et al., "A Measure of Transaction Processing Power" (in red book),
Datamation, 31(7), 1985 (*)
- Eisenberg, A., and Melton, J., Standards in
Practice, ACM SIGMOD Record, September 1998
Self-tuning Database systems
- Chaudhuri, S. and Weikum, G. Rethinking Database System
Architecture: Towards a Self-tuning RISC-style Database System,
Proceedings of the VLDB conference, September 2000 (*)
Distributed and Mobile Database Systems
- Swarup Acharya, Rafael Alonso, Michael J. Franklin, and Stanley B. Zdonik,
Broadcast
Disks: Data Management for Asymmetric Communications Environments,
Proceedings of SIGMOD Conference 1995 (*)
- Daniel Barbara, Mobile
Computing and Databases - A Survey, TKDE 11(1), 108-117, 1999 (*)
Parallel Database Systems
- D. J. DeWitt and J. Gray, Parallel Database
Systems: The Future of High Performance Database Processing,
Communications of the ACM, June 1992 (*)
- D. J. DeWitt, S. Ghandeharizadeh, D. Schneider, H. Hsiao, A. Bricker, and
R. Rasmussen, The GAMMA Database
Machine Project, IEEE Transactions on Knowledge and Data
Engineering, Vol. 2, No. 1, March 1990 (*)
- R. D. Sloan, A practical
implementation of the data base machine-Teradata DBC/1012, Proceedings
of the Twenty-Fifth Hawaii International Conference on System Sciences,
January 1992.
Data Mining
- Rakesh Agrawal, Tomasz Imielinski and Arun Swami,
Mining
Association Rules Between Sets of Items in Large Databases, Proceedings
of the ACM SIGMOD Conference, May 1993 (*)
- M. Mehta, R. Agrawal and J. Rissanen,
SLIQ:
A Fast Scalable Classifier for Data Mining, Proceedings of the Fifth
International Conference on Extending Database Technology, Avignon,
France, March 1996 (*)
Spatial Applications
- A. Guttman, R-Trees: A Dynamic Index
Structure for Spatial Searching, Proceedings of the ACM SIGMOD
Conference, 1984 (*)
- H. Samet and W.G. Aref, Spatial Data Models and Query
Processing, Modern Database Systems, 1995 (*)
- Y. Manolopoulos, E. Nardelli, A. Papadopoulos, and G. Proietti, QR-Tree: A Hybrid
Spatial Data Structure, , 1996
- J. Karlsson, hQT*: A Scalable
Distributed Data Structure for High-Performance Spatial Accesses,
Proceedings of the International Conference on Foundations of Data
Organization and Algorithms (FODO), 1998
- H. Samet, The Design and
Analysis of Spatial Data Structures, Addison-Wesley, Reading, MA,
1990
- H. Samet, The
Applications of Spatial Data Structures: Computer Graphics, Image Processing
and GIS, Addison-Wesley, Reading, MA, 1990
Semistructured Data and XML
| Background:
| S. Abiteboul, P. Buneman, D. Suciu, Data on the Web: From Relations to
Semistructured Data and XML, Morgan Kaufmann, 2000. Chapters/sections: 2,
3.1, 3.2.1, 3.2.2, 3.3.1, 4.1, 4.2 |
- D. Suciu, An
Overview of Semistructured Data, SIGACT News, 29(4):28-38, Dec 1998 (*)
- J. Shanmugasundaram, G. He, K. Tufte, C. Zhang, D. DeWitt, J. Naughton,
Relational Databases for Querying
XML Documents: Limitations and opportunities Proceedings of the 25th
VLDB, 302-314, 1999 (*)
- J. Shanmugasundaram, E. Shekita, R.Barr, M. J. Carey, B. G. Lindsay, H.
Pirahesh, B. Reinwald, Effciently Publishing Relational
Data as XML Documents, Proceedings of the 26th VLDB, 65-67, 2000
Adaptive Query Processing
- Joseph M. Hellerstein, Michael J. Franklin, Sirish Chandrasekaran, Amol
Deshpande, Kris Hildrum, Sam Madden, Vijayshankar Raman, Mehul A. Shah, Adaptive
Query Processing: Technology in Evolution, Special issue on Adaptive Query
Processing, 23(2), Dec. 1993 (*)
- Zachary G. Ives, Alon Y. Levy, Daniel S. Weld, Daniela Florescu, Marc
Friedman, Adaptive
Query Processing for Internet Applications, Special issue on Adaptive
Query Processing, 23(2), Dec. 1993 (*)
- Tolga Urhan and Michael J. Franklin, XJoin:
A Reactively-Scheduled Pipelined Join Operator, Special issue on Adaptive
Query Processing, 23(2), Dec. 1993 (*)
- Luc Bouganim, Françoise Fabret, and C. Mohan, A
Dynamic Query Processing Architecture for Data Integration Systems,
Special issue on Adaptive Query Processing, 23(2), Dec. 1993 (*)
Here is a (tentative) list of papers that covered in Hot Topics in Database
Systems course at CMU. Most papers are from the latest conferences (VLDB, SIGMOD,
ICDE, etc) but there may be some older ones. Paper suggestions are welcome. For
the majority of the papers there are hyperlinks to the electronic version (PS or
PDF), mostly hosted by the ACM Digital Library.
Query Optimization
- Selinger, P., et al., Access Path Selection in a Relational Database Management System, ACM
SIGMOD 1979.
- V. Poosala, P.J. Haas, Y. Ioannidis, and E.J. Shekita.
Improved histograms for selectivity estimation of range predicates,
Proceedings of the ACM SIGMOD Conference, 1996
- M. Stillger, et. al., LEO: DB2's LEarning Optimizer. VLDB 2001.
Supplementary reading
Self-tuning Database Systems
- S. Finkelstein, et. al. Physical Database Design for Relational Databases. TODS 13(1): 91-128
(1988)
- Agrawal S., Chaudhuri S. and Narasayya V.,
Automated Selection of Materialized Views and Indexes for SQL Databases.
Proceedings of the 26th International Conference on Very Large Databases
(VLDB00), Cairo, Egypt, 2000, pp. 496-505, 2000
- G.Lohman, G.Valentin, D.Zilio, M.Zuliani, A. Skelly,
DB2 Advisor: An Optimizer Smart Enough to Recommend Its Own Indexes,
Proceedings, 16th IEEE Conference on Data Engineering, San Diego, CA, 2000
- J.Rao, C.Zhang, G.Lohman, N.Megiddo, Automating Physical Database Design in a Parallel Database System,
Proceedings, 2002 ACM SIGMOD, Madison WI
Supplementary reading
- G. Weikum, et. al. Self-tuning Database Technology and Information Services: from Wishful
Thinking to Viable Engineering, Proceedings of the VLDB conference,
August 2002
- S. Chaudhuri and G. Weikum. Rethinking Database System Architecture: Towards a Self-tuning RISC-style
Database System, Proceedings of the VLDB conference, September 2000
- Chaudhuri S. and Narasayya V, An Efficient Cost-Driven Index Selection Tool for Microsoft SQL Server.
Proceedings of the 23rd International Conference on Very Large Databases
(VLDB97), Athens, Greece, 1997, pp. 146-155, 1997
- "Index Tuning Wizard for Microsoft SQL Server 2000", SQL Server White
Papers,
- H. Gupta, V. Harinaxayan, A. Itajazaman, J. D. Ullman.
Index Selection in a Self Adaptive Database
Management System Proceedings, 1976 ACM SIGMOD Conference
Stream and Adaptive Query Processing
- T. Urhan, et. al. Cost-based Query Scrambling for Initial Delays. Sigmod 1998.
- S. Babu and J. Widom. Continuous Queries over Data Streams, Sigmod Record 30(3), 2001.
- S. Madden et. al., Continuously Adaptive Continuous Queries over Streams, SIGMOD 2002
- S. Chandrasekaran and M. Franklin.
Streaming Queries over Streaming Data, VLDB 2002
- D. Carney, et. al. Monitoring Streams - A New Class of Data Management Applications, VLDB
2002
- Gibbons and Tirthapura. Distributed Streams Algorithms for Sliding Windows, SPAA 2002.
Supplementary reading
- Datar, Gionis, Indyk, Motwani. Maintaining stream statistics over sliding windows
- Gibbons, P. Distinct sampling for highly-accurate answers to distinct values queries and
event reports, VLDB 2001
- Philippe Bonnet, Anthony Tomasic.
Partial Answers for Unavailable Data Sources, FQAS 1998: 43-54
- Anthony Tomasic, Hector Garcia-Molina.
Caching and Database Scaling in Distributed Shard-Nothing Information
Retrieval Systems, SIGMOD 1993
- B. Babcock, et. al. Models and Issues in Data Stream Systems, PODS 2002
- Joseph M. Hellerstein, Michael J. Franklin, Sirish Chandrasekaran, Amol
Deshpande, Kris Hildrum, Sam Madden, Vijayshankar Raman, Mehul A. Shah,
Adaptive Query Processing: Technology in Evolution, Special issue on
Adaptive Query Processing, 23(2), Dec. 1993
- Zachary G. Ives, Alon Y. Levy, Daniel S. Weld, Daniela Florescu, Marc
Friedman,
Adaptive Query Processing for Internet Applications, Special issue on
Adaptive Query Processing, 23(2), Dec. 1993
- Tolga Urhan and Michael J. Franklin,
XJoin: A Reactively-Scheduled Pipelined Join Operator, Special issue on
Adaptive Query Processing, 23(2), Dec. 1993
- J. M. Hellerstein, P. J. Haas, and H. J. Wang.
Online Aggregation. ACM SIGMOD 1997.
- P. J. Haas, J. M. Hellerstein.
Ripple Joins for Online Aggregation. ACM SIGMOD 1999.
Semistructured Data and XML
- J.McHugh, S.Abiteboul, R.Goldman, D.Quass, J.Widom
Lore: A Database Management System for Semistructured Data Sigmod Record,
Sept.1997(*)
- J. Shanmugasundaram, J.Kiernan, E.Shekitam, C.Fan, J.Funderbruk
Querying XML Views of Relational Data, Proceedings of the VLDB2001
Supplementary Reading
- S. Abiteboul, P. Buneman, D. Suciu, Data on the Web: From Relations to
Semistructured Data and XML, Morgan Kaufmann, 2000. Chapters/sections: 2, 3.1,
3.2.1, 3.2.2, 3.3.1, 4.1, 4.2
- R. Goldman, J. McHugh, and J. Widom. From Semistructured Data to XML: Migrating the Lore Data Model and Query
Language Proceedings of the 2nd International Workshop on the Web and
Databases (WebDB '99), Philadelphia, Pennsylvania, June 1999
- J. McHugh and J. Widom. Query Optimization for XML Proceedings of the Twenty-Fifth International
Conference on Very Large Data Bases, Edinburgh, Scotland, September 1999
- J. Shanmugasundaram, E. Shekita, R.Barr, M. J. Carey, B. G. Lindsay, H.
Pirahesh, B. Reinwald, Effciently Publishing Relational Data as XML Documents, Proceedings of
the 26th VLDB, 65-67, 2000
- J. Shanmugasundaram, G. He, K. Tufte, C. Zhang, D. DeWitt, J. Naughton,
Relational Databases for
Querying XML Documents: Limitations and opportunities Proceedings of
the 25th VLDB, 302-314, 1999
Distributed/Peer-to-peer databases
Background Reading
Required Reading for Class
- Gray, Helland, O'Neil, Shasha The Dangers of Replication and a Solution Proceedings of the ACM SIGMOD
Conference, 1996 (15-721 Slides on
replication )
- Ratnasamy, Francis, Handley, Karp, Schenker
A Scalable Content-Addressable Network
Proceedings of SIGCOMM 2001
- (SAME DAY) Gribble, Halevy, Ives, Rodrig, Suciu
What can Peer-to-Peer Do for Databases, and Vice Versa?
Supplementary Reading
Architecture Conscious Database Systems
Background Reading
- Hennesy J., Patterson D, "Computer Architecture: A Quantitative Approach"
Required Reading for Class
- A. Ailamaki, D.J. DeWitt, M.D. Hill, and D.A. Wood.
DBMSs on a Modern Processor: Where Does Time Go?
In proceedings of the
25th International Conference on Very Large Data Bases (VLDB), Edinburgh, UK,
September 1999
- Stefan Manegold, Peter Boncz, Martin L. Kersten
Generic Database Cost Models for Hierarchical Memory Systems
In
Proceedings of VLDB 2002
Supplementary Reading
- Kimberly Keeton, David A. Patterson, Yong Qiang He, Roger C. Raphael,
Walter E. Baker Performance Characterization of a Quad Pentium Pro SMP Using OLTP Workloads
In Proceedings of International Symposium on Computer Architecture (ISCA)
1998
- A. Ailamaki, D.J. DeWitt, M.D. Hill, and M. Skounakis.
Weaving Relations for Cache Performance
Procesdings of the 27th
International Conference on Very Large Data Bases (VLDB), Roma, Italy,
September 2001
- P. A. Boncz, S. Manegold, M. L. Kersten.
Database Architecture Optimized for the New Bottleneck: Memory Access
In
Proceedings of the International Conference on Very Large Data Bases (VLDB),
pp 54-65, Edinburgh, United Kingdom, September 1999
- Ravishankar Ramamurthy, David J. DeWitt, Qi Su
A Case for Fractured Mirrors In Proceedings of VLDB 2002
- Shimin Chen, Phillip B. Gibbons, Todd C. Mowry, and Gary Valentin.
Fractal Prefetching B+-Trees: Optimizing Both Cache and Disk Performance
In Proceedings of SIGMOD 2002
- Shimin Chen , Phillip B. Gibbons , and Todd C. Mowry
Improving Index Performance through Prefetching
In Proceedings of SIGMOD
2001
|