Contact The DL Team Contact Us | Switch to tabbed view

top of pageABSTRACT

One of the most well-studied problems in data mining is mining for association rules in market basket data. Association rules, whose significance is measured via support and confidence, are intended to identify rules of the type, “A customer purchasing item A often also purchases item B.” Motivated by the goal of generalizing beyond market baskets and the association rules used with them, we develop the notion of mining rules that identify correlations (generalizing associations), and we consider both the absence and presence of items as a basis for generating rules. We propose measuring significance of associations via the chi-squared test for correlation from classical statistics. This leads to a measure that is upward closed in the itemset lattice, enabling us to reduce the mining problem to the search for a border between correlated and uncorrelated itemsets in the lattice. We develop pruning strategies and devise an efficient algorithm for the resulting problem. We demonstrate its effectiveness by testing it on census data and finding term dependence in a corpus of text documents, as well as on synthetic data.

top of pageAUTHORS



Author image not provided  Sergey Brin

No contact information provided yet.

Bibliometrics: publication history
Publication years1995-2012
Publication count14
Citation Count4,793
Available for download5
Downloads (6 Weeks)45
Downloads (12 Months)434
Downloads (cumulative)12,359
Average downloads per article2,471.80
Average citations per article342.36
View colleagues of Sergey Brin


Author image not provided  Rajeev Motwani

No contact information provided yet.

Bibliometrics: publication history
Publication years1986-2013
Publication count176
Citation Count11,123
Available for download72
Downloads (6 Weeks)453
Downloads (12 Months)4,183
Downloads (cumulative)90,150
Average downloads per article1,252.08
Average citations per article63.20
View colleagues of Rajeev Motwani


Author image not provided  Craig Silverstein

No contact information provided yet.

Bibliometrics: publication history
Publication years1997-2003
Publication count15
Citation Count1,121
Available for download8
Downloads (6 Weeks)37
Downloads (12 Months)414
Downloads (cumulative)12,770
Average downloads per article1,596.25
Average citations per article74.73
View colleagues of Craig Silverstein

top of pageREFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
R. Agrawal, A. Arning, T. Bollinger, M. Mehta, J. Sharer, and R. Srikant. The Quest Data Mining System. In Proceedings of the Second International Conference on Knowledge Discovery in Databases and Data, August 1996.
2
 
3
 
4
 
5
 
6
A. Agresti. A survey of exact inference for contingency tables. Statistical Science, 7:131-177, 1992.
 
7
M. Dietzfelbinger, A. Karlin, K. Mehlhorn, F. Meyer auf der Heide, H. Rohnert, and H. Tarjan. Dynamic perfect hashing: Upper and lower bounds. In Proceedings of the 18th IEEE Symposium on Foundations of Computer Science, pages 524-531, 1988.
 
8
Ft. Ewald. Keynote address. The 3rd International Conference on Information and Knowledge Management, 1994.
 
9
10
11
12
 
13
J. Gray, A. Bosworth, A. Layman, and H. Pirahesh. Data cube: a relational aggregation operator generalizing groupby, cross-tab, and sub-totals. Microsoft Technical Report MSR-TR-95-22, 1995.
 
14
 
15
 
16
17
 
18
H.O. Lancaster. The Chi-squared Distribution. John Wiley & Sons, New York, 1969.
 
19
P.S. de Laplace. Oeuvres compl~tes de Laplace publides sous les auspices de i'Acad~mie des Sciences par M.M. les secrdtaires perp~tuels. Gauthier-Villar, Paris, 1878/1912.
 
20
H. Mannila, H. Toivonen, and A. Inkeri Verkamo. Efficient algorithms for discovering association rules. In Proceedings of the AAAI Workshop on Knowledge Discovery in Databases, pages 144-155, July 1994.
 
21
A. de Moivre. Approximatio ad summam terminorum binomii (a + b)n in seriem expansi. Supplement to Miscellanea Analytica, London, 1733.
 
22
D.S. Moore. Tests of chi-squared type. In: R.B. D'Agostino and M.A. Stephens (eds), Goodness-of-Fit Techniques, Marcel Dekker, New York, 1986, pp. 63-95.
 
23
F. Mosteller and D. Wallace. Inference and Disputed Authorship: The Federalists. Addison-Wesley, 1964.
24
 
25
K. Pearson. On a criterion that a given system of deviations from the probable in the case of correlated system of variables is such that it can be reasonably supposed to have arisen from random sampling. Philos. Mag., 5:157-175, 1900.
 
26
 
27
 
28
 
29
30

top of pageCITED BY

396 Citations

 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

top of pageINDEX TERMS

The ACM Computing Classification System (CCS rev.2012)

Note: Larger/Darker text within each node indicates a higher relevance of the materials to the taxonomic classification.

top of pagePUBLICATION

· Proceeding
Title SIGMOD '97 Proceedings of the 1997 ACM SIGMOD international conference on Management of data table of contents
Editors Joan M. Peckman Univ. of Rhode Island, Kingston
Sudha Ram Univ. of Arizona, Tucson
Michael Franklin
Pages 265-276
Publication Date1997-06-01 (yyyy-mm-dd)
Sponsor SIGMOD ACM Special Interest Group on Management of Data
PublisherACM New York, NY, USA ©1997
ISBN: 0-89791-911-4 Order Number: 472970 doi>10.1145/253260.253327
Conference MODInternational Conference on Management of Data MOD logo
Paper Acceptance Rate 42 of 202 submissions, 21%
Overall Acceptance Rate 1,104 of 5,662 submissions, 19%
Year Submitted Accepted Rate
SIGMOD '96 290 47 16%
SIGMOD '97 202 42 21%
SIGMOD '00 248 42 17%
SIGMOD '01 293 44 15%
SIGMOD '02 240 42 18%
SIGMOD '03 342 53 15%
SIGMOD '06 446 58 13%
SIGMOD '07 480 70 15%
SIGMOD '08 435 78 18%
SIGMOD '09 430 118 27%
SIGMOD '10 384 80 21%
SIGMOD '11 375 93 25%
SIGMOD '12 289 48 17%
SIGMOD '13 372 76 20%
SIGMOD '14 421 107 25%
SIGMOD '15 415 106 26%
Overall 5,662 1,104 19%
· Newsletter
Title ACM SIGMOD Record Homepage table of contents archive
Volume 26 Issue 2, June 1997
Chairmen Sudha Ram Univ. of Arizona, Tucson
Editors Joan M. Peckham Univ. of Rhode Island, Kingston
Pages 265-276
Publication Date1997-06-01 (yyyy-mm-dd)
Sponsor SIGMOD ACM Special Interest Group on Management of Data
PublisherACM New York, NY, USA
ISSN: 0163-5808 doi>10.1145/253262.253327

APPEARS IN
Digital Content
Digital Content

top of pageREVIEWS


Reviews are not available for this item
Computing Reviews logo

top of pageCOMMENTS

Be the first to comment To Post a comment please sign in or create a free Web account

top of pageTable of Contents

Proceedings of the 1997 ACM SIGMOD international conference on Management of data
Table of Contents
Fast parallel similarity search in multimedia databases
Stefan Berchtold, Christian Böhm, Bernhard Braunmüller, Daniel A. Keim, Hans-Peter Kriegel
Pages: 1-12
doi>10.1145/253260.253263
Full text: PDFPDF

Most similarity search techniques map the data objects into some high-dimensional feature space. The similarity search then corresponds to a nearest-neighbor search in the feature space which is computationally very intensive. In this paper, we present ...
expand
Similarity-based queries for time series data
Davood Rafiei, Alberto Mendelzon
Pages: 13-25
doi>10.1145/253260.253264
Full text: PDFPDF

We study a set of linear transformations on the Fourier series representation of a sequence that can be used as the basis for similarity queries on time-series data. We show that our set of transformations is rich enough to formulate operations such ...
expand
Meaningful change detection in structured data
Sudarshan S. Chawathe, Hector Garcia-Molina
Pages: 26-37
doi>10.1145/253260.253266
Full text: PDFPDF

Detecting changes by comparing data snapshots is an important requirement for difference queries, active databases, and version and configuration management. In this paper we focus on detecting meaningful changes in hierarchically structured data, such ...
expand
Improved query performance with variant indexes
Patrick O'Neil, Dallan Quass
Pages: 38-49
doi>10.1145/253260.253268
Full text: PDFPDF

The read-mostly environment of data warehousing makes it possible to use more complex indexes to speed up queries than in situations where concurrent updates are present. The current paper presents a short review of current indexing technology, including ...
expand
Highly concurrent cache consistency for indices in client-server database systems
Markos Zaharioudakis, Michael J. Carey
Pages: 50-61
doi>10.1145/253260.253269
Full text: PDFPDF

In this paper, we present four approaches to providing highly concurrent B+-tree indices in the context of a data-shipping, client-server OODBMS architecture. The first performs all index operations at the server, ...
expand
Concurrency and recovery in generalized search trees
Marcel Kornacker, C. Mohan, Joseph M. Hellerstein
Pages: 62-72
doi>10.1145/253260.253272
Full text: PDFPDF

This paper presents general algorithms for concurrency control in tree-based access methods as well as a recovery protocol and a mechanism for ensuring repeatable read. The algorithms are developed in the context of the Generalized Search Tree (GiST) ...
expand
Range queries in OLAP data cubes
Ching-Tien Ho, Rakesh Agrawal, Nimrod Megiddo, Ramakrishnan Srikant
Pages: 73-88
doi>10.1145/253260.253274
Full text: PDFPDF

A range query applies an aggregation operation over all selected cells of an OLAP data cube where the selection is specified by providing ranges of values for numeric dimensions. We present fast algorithms for range queries for two types of aggregation ...
expand
Cubetree: organization of and bulk incremental updates on the data cube
Nick Roussopoulos, Yannis Kotidis, Mema Roussopoulos
Pages: 89-99
doi>10.1145/253260.253276
Full text: PDFPDF

The data cube is an aggregate operator which has been shown to be very powerful for On Line Analytical Processing (OLAP) in the context of data warehousing. It is, however, very expensive to compute, access, and maintain. In this paper we define the ...
expand
Maintenance of data cubes and summary tables in a warehouse
Inderpal Singh Mumick, Dallan Quass, Barinderpal Singh Mumick
Pages: 100-111
doi>10.1145/253260.253277
Full text: PDFPDF

Data warehouses contain large amounts of information, often collected from a variety of independent sources. Decision-support functions in a warehouse, such as on-line analytical processing (OLAP), involve hundreds of complex aggregate ...
expand
Database buffer size investigation for OLTP workloads
Thin-Fong Tsuei, Allan N. Packer, Keng-Tai Ko
Pages: 112-122
doi>10.1145/253260.253279
Full text: PDFPDF

It is generally accepted that On-Line Transaction Processing (OLTP) systems benefit from large database memory buffers. As enterprise database systems become larger and more complex, hardware vendors are building increasingly large systems capable of ...
expand
Database performance in the real world: TPC-D and SAP R/3
Joachen Doppelhammer, Thomas Höppler, Alfons Kemper, Donald Kossmann
Pages: 123-134
doi>10.1145/253260.253280
Full text: PDFPDF

Traditionally, database systems have been evaluated in isolation on the basis of standardized benchmarks (e.g., Wisconsin, TPC-C, TPC-D). We argue that very often such a performance analysis does not reflect the actual use of the DBMSs in the ...
expand
The BUCKY object-relational benchmark
Michael J. Carey, David J. DeWitt, Jeffrey F. Naughton, Mohammad Asgarian, Paul Brown, Johannes E. Gehrke, Dhaval N. Shah
Pages: 135-146
doi>10.1145/253260.253283
Full text: PDFPDF

According to various trade journals and corporate marketing machines, we are now on the verge of a revolution—the object-relational database revolution. Since we believe that no one should face a revolution without appropriate armaments, this paper ...
expand
The STRIP rule system for efficiently maintaining derived data
Brad Adelberg, Hector Garcia-Molina, Jennifer Widom
Pages: 147-158
doi>10.1145/253260.253287
Full text: PDFPDF

Derived data is maintained in a database system to correlate and summarize base data which records real world facts. As base data changes, derived data needs to be recomputed. This is often implemented by writing active rules that are triggered by changes ...
expand
An array-based algorithm for simultaneous multidimensional aggregates
Yihong Zhao, Prasad M. Deshpande, Jeffrey F. Naughton
Pages: 159-170
doi>10.1145/253260.253288
Full text: PDFPDF

Computing multiple related group-bys and aggregates is one of the core operations of On-Line Analytical Processing (OLAP) applications. Recently, Gray et al. [GBLP95] proposed the “Cube” operator, which computes group-by aggregations over ...
expand
Online aggregation
Joseph M. Hellerstein, Peter J. Haas, Helen J. Wang
Pages: 171-182
doi>10.1145/253260.253291
Full text: PDFPDF

Aggregation in traditional database systems is performed in batch mode: a query is submitted, the system processes a large volume of data over a long period of time, and, eventually, the final answer is returned. This archaic approach is frustrating ...
expand
Balancing push and pull for data broadcast
Swarup Acharya, Michael Franklin, Stanley Zdonik
Pages: 183-194
doi>10.1145/253260.253293
Full text: PDFPDF

The increasing ability to interconnect computers through internet-working, wireless networks, high-bandwidth satellite, and cable networks has spawned a new class of information-centered applications based on data dissemination. These ...
expand
InfoSleuth: agent-based semantic integration of information in open and dynamic environments
R. J. Bayardo, Jr., W. Bohrer, R. Brice, A. Cichocki, J. Fowler, A. Helal, V. Kashyap, T. Ksiezyk, G. Martin, M. Nodine, M. Rashid, M. Rusinkiewicz, R. Shea, C. Unnikrishnan, A. Unruh, D. Woelk
Pages: 195-206
doi>10.1145/253260.253294
Full text: PDFPDF

The goal of the InfoSleuth project at MCC is to exploit and synthesize new technologies into a unified system that retrieves and processes information in an ever-changing network of information sources. InfoSleuth has its roots in the Carnot project ...
expand
STARTS: Stanford proposal for Internet meta-searching
Luis Gravano, Chen-Chuan K. Chang, Héctor García-Molina, Andreas Paepcke
Pages: 207-218
doi>10.1145/253260.253299
Full text: PDFPDF

Document sources are available everywhere, both within the internal networks of organizations and on the Internet. Even individual organizations use search engines from different vendors to index their internal document collections. These search engines ...
expand
On saying “Enough already!” in SQL
Michael J. Carey, Donald Kossmann
Pages: 219-230
doi>10.1145/253260.253302
Full text: PDFPDF

In this paper, we study a simple SQL extension that enables query writers to explicitly limit the cardinality of a query result. We examine its impact on the query optimization and run-time execution components of a relational DBMS, presenting ...
expand
A framework for implementing hypothetical queries
Timothy Griffin, Richard Hull
Pages: 231-242
doi>10.1145/253260.253304
Full text: PDFPDF

Previous approaches to supporting hypothetical queries have been “eager”: some representation of the hypothetical state (or the corresponding delta) is materialized, and query evaluation is filtered through that representation. This paper ...
expand
High-performance sorting on networks of workstations
Andrea C. Arpaci-Dusseau, Remzi H. Arpaci-Dusseau, David E. Culler, Joseph M. Hellerstein, David A. Patterson
Pages: 243-254
doi>10.1145/253260.253322
Full text: PDFPDF

We report the performance of NOW-Sort, a collection of sorting implementations on a Network of Workstations (NOW). We find that parallel sorting on a NOW is competitive to sorting on the large-scale SMPs that have traditionally held the performance records. ...
expand
Dynamic itemset counting and implication rules for market basket data
Sergey Brin, Rajeev Motwani, Jeffrey D. Ullman, Shalom Tsur
Pages: 255-264
doi>10.1145/253260.253325
Full text: PDFPDF

We consider the problem of analyzing market-basket data and present several important contributions. First, we present a new algorithm for finding large itemsets which uses fewer passes over the data than classic algorithms, and yet uses fewer candidate ...
expand
Beyond market baskets: generalizing association rules to correlations
Sergey Brin, Rajeev Motwani, Craig Silverstein
Pages: 265-276
doi>10.1145/253260.253327
Full text: PDFPDF

One of the most well-studied problems in data mining is mining for association rules in market basket data. Association rules, whose significance is measured via support and confidence, are intended to identify rules of the type, “A customer purchasing ...
expand
Scalable parallel data mining for association rules
Eui-Hong Han, George Karypis, Vipin Kumar
Pages: 277-288
doi>10.1145/253260.253330
Full text: PDFPDF

One of the important problems in data mining is discovering association rules from databases of transactions where each transaction consists of a set of items. The most time consuming operation in this discovery process is the computation of the frequency ...
expand
Efficiently supporting ad hoc queries in large datasets of time sequences
Flip Korn, H. V. Jagadish, Christos Faloutsos
Pages: 289-300
doi>10.1145/253260.253332
Full text: PDFPDF

Ad hoc querying is difficult on very large datasets, since it is usually not possible to have the entire dataset on disk. While compression can be used to decrease the size of the dataset, compressed data is notoriously difficult to index or access. In ...
expand
DEVise: integrated querying and visual exploration of large datasets
M. Livny, R. Ramakrishnan, K. Beyer, G. Chen, D. Donjerkovic, S. Lawande, J. Myllymaki, K. Wenger
Pages: 301-312
doi>10.1145/253260.253335
Full text: PDFPDF

DEVise is a data exploration system that allows users to easily develop, browse, and share visual presentation of large tabular datasets (possibly containing or referencing multimedia objects) from several sources. The DEVise framework is being implemented ...
expand
Partitioned garbage collection of a large object store
Umesh Maheshwari, Barbara Liskov
Pages: 313-323
doi>10.1145/253260.253338
Full text: PDFPDF

We present new techniques for efficient garbage collection in a large persistent object store. The store is divided into partitions that are collected independently using information about inter-partition references. This information is maintained on ...
expand
Size separation spatial join
Nick Koudas, Kenneth C. Sevcik
Pages: 324-335
doi>10.1145/253260.253340
Full text: PDFPDF

We introduce a new algorithm to compute the spatial join of two or more spatial data sets, when indexes are not available on them. Size Separation Spatial Join (S3J) imposes a hierarchical decomposition ...
expand
Building a scaleable geo-spatial DBMS: technology, implementation, and evaluation
Jignesh Patel, JieBing Yu, Navin Kabra, Kristin Tufte, Biswadeep Nag, Josef Burger, Nancy Hall, Karthikeyan Ramasamy, Roger Lueder, Curt Ellmann, Jim Kupsch, Shelly Guo, Johan Larson, David De Witt, Jeffrey Naughton
Pages: 336-347
doi>10.1145/253260.253342
Full text: PDFPDF

This paper presents a number of new techniques for parallelizing geo-spatial database systems and discusses their implementation in the Paradise object-relational database system. The effectiveness of these techniques is demonstrated using a variety ...
expand
A toolkit for negotiation support interfaces to multi-dimensional data
Michael Gebhardt, Matthias Jarke, Stephan Jacobs
Pages: 348-356
doi>10.1145/253260.253344
Full text: PDFPDF

CoDecide is an experimental user interface toolkit that offers an extension to spreadsheet concepts specifically geared towards support for cooperative analysis of the kinds of multi-dimensional data encountered in data warehousing. It is distinguished ...
expand
Distance-based indexing for high-dimensional metric spaces
Tolga Bozkaya, Meral Ozsoyoglu
Pages: 357-368
doi>10.1145/253260.253345
Full text: PDFPDF

In many database applications, one of the common queries is to find approximate matches to a given query item from a collection of data items. For example, given an image database, one may want to retrieve all images that are similar to a given query ...
expand
The SR-tree: an index structure for high-dimensional nearest neighbor queries
Norio Katayama, Shin'ichi Satoh
Pages: 369-380
doi>10.1145/253260.253347
Full text: PDFPDF

Recently, similarity queries on feature vectors have been widely used to perform content-based retrieval of images. To apply this technique to large databases, it is required to develop multidimensional index structures supporting nearest neighbor queries ...
expand
Wave-indices: indexing evolving databases
Narayanan Shivakumar, Héctor García-Molina
Pages: 381-392
doi>10.1145/253260.253349
Full text: PDFPDF

In many applications, new data is being generated every day. Often an index of the data of a past window of days is required to answer queries efficiently. For example, in a warehouse one may need an index on the sales records of the last week for efficient ...
expand
On-line warehouse view maintenance
Dallan Quass, Jennifer Widom
Pages: 393-404
doi>10.1145/253260.253352
Full text: PDFPDF

Data warehouses store materialized views over base data from external sources. Clients typically perform complex read-only queries on the views. The views are refreshed periodically by maintenance transactions, which propagate large ...
expand
Supporting multiple view maintenance policies
Latha S. Colby, Akira Kawaguchi, Daniel F. Lieuwen, Inderpal Singh Mumick, Kenneth A. Ross
Pages: 405-416
doi>10.1145/253260.253353
Full text: PDFPDF

Materialized views and view maintenance are becoming increasingly important in practice. In order to satisfy different data currency and performance requirements, a number of view maintenance policies have been proposed. Immediate maintenance involves ...
expand
Efficient view maintenance at data warehouses
D. Agrawal, A. El Abbadi, A. Singh, T. Yurek
Pages: 417-427
doi>10.1145/253260.253355
Full text: PDFPDF

We present incremental view maintenance algorithms for a data warehouse derived from multiple distributed autonomous data sources. We begin with a detailed framework for analyzing view maintenance algorithms for multiple data sources with concurrent ...
expand
Eliminating costly redundant computations from SQL trigger executions
François Llirbat, Françoise Fabret, Eric Simon
Pages: 428-439
doi>10.1145/253260.253357
Full text: PDFPDF

Active database systems are now in widespread use. The use of triggers in these systems, however, is difficult because of the complex interaction between triggers, transactions, and application programs. Repeated calculations of rules may incur costly ...
expand
Temporal aggregation in active database rules
Iakovos Motakis, Carlo Zaniolo
Pages: 440-451
doi>10.1145/253260.253359
Full text: PDFPDF

An important feature of many advanced active database prototypes is support for rules triggered by complex patterns of events. Their composite event languages provide powerful primitives for event-based temporal reasoning. In fact, with one important ...
expand
Association rules over interval data
R. J. Miller, Y. Yang
Pages: 452-461
doi>10.1145/253260.253361
Full text: PDFPDF

We consider the problem of mining association rules over interval data (that is, ordered data for which the separation between data points has meaning). We show that the measures of what rules are most important (also called rule interest) ...
expand
Secure transaction processing in firm real-time database systems
Binto George, Jayant Haritsa
Pages: 462-473
doi>10.1145/253260.253362
Full text: PDFPDF

Many real-time database applications arise in safety-critical installations and military systems where enforcing security is crucial to the success of the enterprise. A secure real-time database system has to simultaneously satisfy who requirements guarantee ...
expand
A unified framework for enforcing multiple access control policies
Sushil Jajodia, Pierangela Samarati, V. S. Subrahmanian, Eliza Bertino
Pages: 474-485
doi>10.1145/253260.253364
Full text: PDFPDF

Although several access control policies can be devised for controlling access to information, all existing authorization models, and the corresponding enforcement mechanisms, are based on a specific policy (usually the closed policy). ...
expand
Revisiting commit processing in distributed database systems
Ramesh Gupta, Jayant Haritsa, Krithi Ramamritham
Pages: 486-497
doi>10.1145/253260.253366
Full text: PDFPDF

A significant body of literature is available on distributed transaction commit protocols. Surprisingly, however, the relative merits of these protocols have not been studied with respect to their quantitative impact on transaction processing ...
expand
Lessons from Wall Street: case studies in configuration, tuning, and distribution
Dennis Shasha
Pages: 498-501
doi>10.1145/253260.253368
Full text: PDFPDF

Consider a setting in which Database speed and reliability can make the difference between prosperity and ruin. Money for information systems is no object. Data must be accessible from many points on the globe with ...
expand
Object-relational database systems (tutorial): principles, products and challenges
Michael J. Carey, Nelson M. Mattos, Anil K. Nori
Page: 502
doi>10.1145/253260.253370
Full text: PDFPDF

Object-relational database systems, a.k.a. “universal servers,” are emerging as the next major generation of commercial database system technology. Products from relational DBMS vendors including IBM, Informix, Oracle, UniSQL, and others, ...
expand
Databases on the Web: technologies for federation architectures and case studies
Ralf Kramer
Pages: 503-506
doi>10.1145/253260.253372
Full text: PDFPDF
Data warehousing and OLAP for decision support
Surajit Chaudhuri, Umeshwar Dayal
Pages: 507-508
doi>10.1145/253260.253373
Full text: PDFPDF

On-Line Analytical Processing (OLAP) and Data Warehousing are decision support technologies. Their goal is to enable enterprises to gain competitive advantage by exploiting the ever-growing amount of data that is collected and stored ...
expand
Query optimization at the crossroads
Surajit Chaudhuri
Page: 509
doi>10.1145/253260.253374
Full text: PDFPDF
Delaunay: a database visualization system
Isabel F. Cruz, M. Averbuch, Wendy T. Lucas, Melissa Radzyminski, Kirby Zhang
Pages: 510-513
doi>10.1145/253260.253376
Full text: PDFPDF

Visual query systems have traditionally supported a set of pre-defined visual displays. We describe the Delaunay system, which supports visualizations of object-oriented databases specified by the user with a visual constraint-based ...
expand
Picture programming project
Nita Goyal, Charles Hoch, Ravi Krishnamurthy, Brian Meckler, Michael Suchow, Moshe Zloof
Pages: 514-516
doi>10.1145/253260.253377
Full text: PDFPDF
DEVise (demo abstract): integrated querying and visual exploration of large datasets
M. Livny, R. Ramakrishnan, K. Beyer, G. Chen, D. Donjerkovic, S. Lawande, J. Myllymaki, K. Wenger
Pages: 517-520
doi>10.1145/253260.253379
Full text: PDFPDF

DEVise is a data exploration system that allows users to easily develop, browse, and share visual presentations of large tabular datasets (possibly containing or referencing multimedia objects) from several sources. The DEVise framework, implemented ...
expand
SEMCOG: an object-based image retrieval system and its visual query interface
Wen-Syan Li, K. Selçuk Candan, Kyoji Hirata, Yoshinori Hara
Pages: 521-524
doi>10.1145/253260.253384
Full text: PDFPDF
The Context Interchange mediator prototype
S. Bressan, C. H. Goh, K. Fynn, M. Jakobisiak, K. Hussein, H. Kon, T. Lee, S. Madnick, T. Pena, J. Qu, A. Shum, M. Siegel
Pages: 525-527
doi>10.1145/253260.253389
Full text: PDFPDF

The Context Interchange strategy presents a novel approach for mediated data access in which semantic conflicts among heterogeneous systems are not identified a priori, but are detected and reconciled by a context mediator ...
expand
MDM: a multiple-data model tool for the management of heterogeneous database schemes
Paolo Atzeni, Riccardo Torlone
Pages: 528-531
doi>10.1145/253260.253393
Full text: PDFPDF

MDM is a tool that enables the users to define schemes of different data models and to perform translations of schemes from one model to another. These functionalities can be at the basis of a customizable and integrated CASE environment supporting the ...
expand
Template-based wrappers in the TSIMMIS system
Joachim Hammer, Héctor García-Molina, Svetlozar Nestorov, Ramana Yerneni, Marcus Breunig, Vasilis Vassalos
Pages: 532-535
doi>10.1145/253260.253395
Full text: PDFPDF

In order to access information from a variety of heterogeneous information sources, one has to be able to translate queries and data from one data model into another. This functionality is provided by so-called (source) wrappers [4,8] ...
expand
Languages for multi-database interoperability
Frédéric Gingras, Laks V. S. Lakshmanan, Iyer N. Subramanian, Despina Papoulis, Nematollaah Shiri
Pages: 536-538
doi>10.1145/253260.253397
Full text: PDFPDF
Infomaster: an information integration system
Michael R. Genesereth, Arthur M. Keller, Oliver M. Duschka
Pages: 539-542
doi>10.1145/253260.253400
Full text: PDFPDF

Infomaster is an information integration system that provides integrated access to multiple distributed heterogeneous information sources on the Internet, thus giving the illusion of a centralized, homogeneous information system. We say that Infomaster ...
expand
The InfoSleuth Project
R. J. Bayardo, Jr., W. Bohrer, R. Brice, A. Cichocki, J. Fowler, A. Halal, V. Kashyap, T. Ksiezyk, G. Martin, M. Nodine, M. Rashid, M. Rusinkiewicz, R. Shea, C. Unnikrishnan, A. Unruh, D. Woelk
Pages: 543-545
doi>10.1145/253260.253401
Full text: PDFPDF
The distributed information search component (Disco) and the World Wide Web
Anthony Tomasic, Rémy Amouroux, Philippe Bonnet, Olga Kapitskaia, Hubert Naacke, Louiqa Raschid
Pages: 546-548
doi>10.1145/253260.253402
Full text: PDFPDF

The Distributed Information Search COmponent (DISCO) is a prototype heterogeneous distributed database that accesses underlying data sources. The DISCO prototype currently focuses on three central research problems in the context of these systems. First, ...
expand
STRUDEL: a Web site management system
Mary Fernandez, Daniela Florescu, Jaewoo Kang, Alon Levy, Dan Suciu
Pages: 549-552
doi>10.1145/253260.253403
Full text: PDFPDF
GeoMiner: a system prototype for spatial data mining
Jaiwei Han, Krzysztof Koperski, Nebojsa Stefanovic
Pages: 553-556
doi>10.1145/253260.253404
Full text: PDFPDF

Spatial data mining is to mine high-level spatial information and knowledge from large spatial databases. A spatial data mining system prototype, GeoMiner, has been designed and developed based on our years of experience in the research and development ...
expand
The WHIPS prototype for data warehouse creation and maintenance
Wilburt J. Labio, Yue Zhuge, Janet L. Wiener, Himanshu Gupta, Héctor García-Molina, Jennifer Widom
Pages: 557-559
doi>10.1145/253260.253405
Full text: PDFPDF

A data warehouse is a repository of integrated information from distributed, autonomous, and possibly heterogeneous, sources. In effect, the warehouse stores one or more materialized views of the source data. The data is then readily available to user ...
expand
Structural matching and discovery in document databases
Jason Tsong-Li Wang, Dennis Shasha, George J. S. Chang, Liam Relihan, Kaizhong Zhang, Girish Patel
Pages: 560-563
doi>10.1145/253260.253406
Full text: PDFPDF

Structural matching and discovery in documents such as SGML and HTML is important for data warehousing [6], version management [7, 11], hypertext authoring, digital libraries [4] and Internet databases. As an example, a user of the World Wide Web may ...
expand
S3: similarity search in CAD database systems
Stefan Berchtold, Hans-Peter Kriegel
Pages: 564-567
doi>10.1145/253260.253407
Full text: PDFPDF

S3 is the prototype of a database system supporting the management and similarity retrieval of industrial CAD parts. The major goal of the system is to reduce the cost for developing and producing new parts by maximizing the reuse of existing parts. ...
expand
PREDATOR: an OR-DBMS with enhanced data types
Praveen Seshadri, Mark Paskin
Pages: 568-571
doi>10.1145/253260.253408
Full text: PDFPDF
Sentinel: an object-oriented DBMS with event-based rules
S. Chakravarthy
Pages: 572-575
doi>10.1145/253260.253409
Full text: PDFPDF
The MENTOR workbench for enterprise-wide workflow management
Dirk Wodtke, Jeanine Weissenfels, Gerhard Weikum, Angelika Kotz Dittrich, Peter Muth
Pages: 576-579
doi>10.1145/253260.253411
Full text: PDFPDF

MENTOR (“Middleware for Enterprise-Wide Workflow Management”) is a joint project of the University of the Saarland, the Union Bank of Switzerland, and ETH Zurich [1, 2, 3]. The focus of the project is on enterprise-wide workflow management. ...
expand
Zoo: a desktop experiment management environment
Yannis E. Ioannidis, Miron Livny, Anastassia Ailamaki, Anand Narayanan, Andrew Therber
Pages: 580-583
doi>10.1145/253260.253415
Full text: PDFPDF

Powered by The ACM Guide to Computing Literature


The ACM Digital Library is published by the Association for Computing Machinery. Copyright © 2017 ACM, Inc.
Terms of Usage   Privacy Policy   Code of Ethics   Contact Us
Did you know the ACM DL App is now available?
Did you know your Organization can subscribe to the ACM Digital Library?
The ACM Guide to Computing Literature
All Tags
Export Formats
 
 
Save to Binder