Recent Progress in Data Integration
Daniela Florescu (INRIA Rocquencourt)
Alon Levy (University of Washington)
Managing
Multimedia Information in a Database Environment
William I. Grosky (Wayne State University)
Invited Speakers:
Rules...and what's next?
Towards Second Generation Data Mining Systems
Tomasz
Imieliñski (Rutgers
University)
Workflow Management
in the Internet Age
C. Mohan (IBM Almaden Research
Center, INRIA Rocquencourt)
Monday, September 7 | ||
08:30 - 10:00 | Tutorial 1 | |
Recent Progress in Data
Integration
Daniela Florescu (INRIA Rocquencourt), Alon Levy (University of Washington) |
||
10:00 - 10:30 | Coffee Break | |
10:30 - 12:00 | Continuation of Tutorial 1 | |
12:00 - 12:30 | Coffee Break | |
12:30 - 14:00 | Continuation of Tutorial 1 | |
14:00-15:30 | Lunch Break | |
15:30-17:00 | Tutorial 2 | |
Managing Multimedia
Information in a Database Environment
William I. Grosky (Wayne State University) |
||
17:00-17:30 | Coffee Break | |
17:30-19:00 | Continuation of Tutorial 2 |
Tuesday, September 8 | ||
09:00 - 09:30 | Opening Session | |
09:30 - 10:30 | Keynote Talk | |
Rules...and what's next?
Towards Second Generation Data Mining Systems Tomasz Imielinski (Rutgers University) |
||
10:30 - 11:00 | Coffee Break | |
11:00 - 12:30 | Session 1: Query Languages | |
Untyped Queries, Untyped
Reflective Machines and Conditional Quantifiers
Jose Maria Turull Torres |
||
Containment of Conjunctive
Queries with Built-in Predicates with Variables and Constants over any
Ordered Domain
Nieves R. Brisaboa, Hector J. Hernandez, Jose R. Parama, Miguel R. Penabad |
||
Using Queries with Multi-Directional
Functions for Numerical Database Applications
Staffan Flodin, Kjell Orsborn, Tore Risch |
||
12:30-14:30 | Lunch Break | |
14:30-16:00 | Session 2: Optimization | |
Multiple Range Query
Optimization in Spatial Databases
Apostolos N. Papadopoulos, Yannis Manolopoulos |
||
Optimizing Command Logs
by Exploiting Semantic Knowledge
Roland Baumann |
||
A Distributed Algorithm
for Global Query Optimization in Multidatabase Systems
Silvio Salza, Giovanni Barone, Tadeusz Morzy |
||
16:00-16:30 | Coffee Break | |
16:30-18:00 | Session 3: Collaborative Systems | |
Transaction Management
in Databases Supporting Collaborative Applications
Waldemar Wieczerzycki |
||
Object-Oriented Design
of a Flexible Workflow Management System
Mathias Weske |
||
Extending Transaction
Closures by N-ary Termination Dependencies
Kerstin Schwarz, Can Tuerker, Gunter Saake |
||
19:30 | Reception | |
Wednesday, September 9 | ||
09:00-10:00 | Invited Talk | |
Workflow Management
in the Internet Age
C. Mohan (IBM Almaden Research Center) |
||
10:00-10:30 | Coffee Break | |
10:30-12:30 | Session 4: East Meets West | |
Distributed Information
Systems
Ralf Kramer, Peter C. Lockemann |
||
Integrated Web - Database
Applications for Electronic Business
Wojciech Cellary |
||
Term Weighting in Query-Based
Document Clustering
Kai Korpimies, Esko Ukkonen |
||
Discovery of Object-Oriented
Schema and Schema Conflicts
Hele-Mai Haav, Mihhail Matskin |
||
On the Ordering of Rewrite
Rules
Joachim Kroeger, Stefan Paul, Andreas Heuer |
||
Towards Data
and Object Modelling
Jaroslav Pokorny |
||
12:30-14:30 | Lunch Break | |
14:30-16:00 | Session 5: Schema Integration | |
Propagation of Structural
Modifications to an Integrated Schema
Regina Motz |
||
Integration of Schemas
Containing Data Versions and Time Components
Bogdan D. Czejdo, Maciej Matysiak, Tadeusz Morzy |
||
Deriving Relationships
between Integrity Constraints for Schema Comparison
Can Tuerker, Gunter Saake |
||
16:00-16:30 | Coffee Break | |
16:30-18:00 | Session 6: Storage and Version Management | |
A Database Interface
Integrating a Querying Language for Versions
Eric Andonoff, Gilles Hubert, Annig Le Parc |
||
The nP-Tree: Region
Partitioning and Indexing for Efficient Path Planning
Lusiana Nawawi, Janusz R. Getta, Phillip J. McKerrow |
||
Replication in Mirrored
Disk Systems
Athena Vakali, Yannis Manolopoulos |
||
19:30 | Choir Concert | |
Thursday, September 10 | ||
09:00-10:30 | Session 7: Object Systems | |
Clustering Techniques
for Minimizing Object Access Time
Vlad S. Wietrzyk, Mehmet A. Orgun |
||
Designing Persistence
for Real-Time Distributed Object Systems
Igor Nekrestyanov, Boris Novikov, Ekaterina Pavlova |
||
Partial Replication
of Object-Oriented Databases
Michael Dobrovnik, Johann Eder |
||
10:30-11:00 | Coffee Break | |
11:00-13:00 | Session 8: Knowledge Discovery and the Web | |
Optimizing Knowledge
Discovery over the WWW
Matthew Montebello |
||
Data Mining Query Language
for Object-0riented Database
Vladimir Novacek |
||
Itemset Materializing
for Fast Mining of Association Rules
Marek Wojciechowski, Maciej Zakrzewicz |
||
Schema Derivation for
WWW Information Sources and their Integration
with Databases in Bioinformatics
Michael Hoeding, Ralf Hofestaedt, Gunter Saake, Uwe Scholz |
||
13:00-15:00 | Lunch Break | |
15:00-16:30 | Session 9: System Design | |
Component-based Information
Systems Development Tool Supporting the SYNTHESIS
Design Method
Dmitry O. Briukhov, Leonid A. Kalinichenko |
||
Translating Relational
Queries to Object-Oriented Queries According to ODMG-93
Ahmed Mostefaoui, Jacques Kouloumdjian |
||
A Flexible Framework
for a Correct Database Design
Donatella Castelli, Serena Pisani |
||
16:30-17:00 | Coffee Break | |
17:00-18:30 | Session 9: Industrial Track | |
Human Resources Information
Systems Improvement: Involving Financial Systems
and Other Sources Data
Sergey Zykov |
||
Physical Structures
Design For Relational Databases
Janusz Charczuk |
||
Modeling of Census Data
in a Multidimensional Environment
Holger Guenzel, Wolfgang Lehner, Stein Eriksen, Jon Folkedal |
TUTORIALISTS AND INVITED SPEAKERS
Daniela Florescu is a researcher in the Rodin group in INRIA Rocquencourt. She received her B.Sc. in Computer Science and Mathematics from the University of Bucharest in 1991, Masters of Computer Science from University of Paris VI in 1992, and a Ph.D. in Computer Science from INRIA and the University of Paris VI in 1996. From November 1996 till December 1997 she was a senior member of the technical staff at AT&T Research Laboratories. Her current research interests are information integration, query optimization in objectoriented database systems, query execution models for parallel databases, query reformulation in multidatabase systems, semistructured data and website management systems.
Alon Levy is an assistant professor in the University of Washington in Seattle. He received his B.Sc. in Computer Science and Mathematics from the Hebrew University in Jerusalem in 1988, and his Ph.D. in Computer Science from Stanford University in 1993. From 1993 to 1997 he was a principal member of the technical staff at AT&T Research Laboratories. His current research interests are information integration, semistructured data, materialized views, website management systems, knowledge representation and connections between database systems and Artificial Intelligence.
William I. Grosky is currently professor and chair of the Computer Science Department at Wayne State University in Detroit, Michigan. His current research interests are in multimedia information systems, hypermedia, image databases, and web technology. Dr. Grosky received in B.S. in mathematics from MIT in 1965, his M.S. in Applied Mathematics from Brown University in 1968, and his Ph.D. in Engineering and Applied Sciences from Yale University in 1971. Serving on many database and multimedia conference program committees, he is currently the Editor-in-Chief of IEEE Multimedia magazine and on the editorial boards of the Journal of Database Management and Pattern Recognition.
Tomasz Imieliñski is currently Professor and Chairman of the Department of Computer Science, Rutgers University in New Brunswick NJ, USA. He has received his Ph.D. from Polish Academy of Science (Warsaw) in 1982. His initial work dealt with the issues of representation and querying of databases with incomplete information. His current interests include database mining and mobile wireless computing. Dr. Imielinski is currently Director of Mobile Computing Laboratory (DataMan) at Rutgers University which is sponsored by DARPA, NSF and a number of companies. Since coauthoring the original paper introducing association rules Dr. Imielinski has been leading the development of "Discovery Board" the experimental data mining system developed at Rutgers University. He has been active in numerous program committees of conferences such as SIGMOD, VLDB, KDD and Mobicomm and is currently an Associate Editor of ACM/Baltzer Nomad - journal of Wireless and Mobile Communications and Computing, and Knowledge Discovery Journal (Kluwer). He has edited three books, including "Mobile Computing", (T. Imielinski, H.Korth, Kluwer 1996). Dr. Imielinski is a cofounder and chief technology officer of Hevelius Software - a company which develops data mining applications.
Dr. C. Mohan, after graduating from the Indian Institute of Technology at Madras in 1977 and the University of Texas at Austin in 1981, joined the IBM Almaden Research Center. In June 1997, he was named an IBM Fellow for being recognized worldwide as a leading innovator in database transaction management. He received the 1996 ACM SIGMOD Innovations Award in recognition of his innovative contributions to the development and use of database systems. Since late 1996, he has been leading the Dominotes project whose goal is to enhance Lotus Domino/Notes by introducing transactional recovery. Earlier, he led the Exotica workflow management project which was focused on IBM's FlowMark, MQSeries and Lotus Notes. During 6/98-6/99, he is on a sabbatical at INRIA, Rocquencourt (France). Mohan has received numerous IBM awards: 1 Corporate Award, 7 Outstanding Innovation Awards, 2 Research Division Awards and the 9th Plateau Invention Achievement Award for patent activities (28 issued, 4 pending). He was a designer and an implementor of the R* distributed DBMS, the Starburst extensible DBMS and DB2. His algorithms have been implemented in several IBM and non-IBM products, and university prototypes. He is the primary inventor of the ARIES family of recovery and locking methods, and the industry-standard Presumed Abort commit protocol. His research interests include concurrency control, recovery, commit protocols, index management, semi-structured data management, query optimization, active databases, OODBMSs, workflow, and distributed systems. He was the Americas Program Chair for the 1996 International Conference on Very Large Data Bases, the Program Chair of the 1987 International Workshop on High Performance Transaction Systems, and a Program Vice-Chair of the 1994 International Conference on Data Engineering. He is an editor of the VLDB Journal, and Distributed and Parallel Databases - An International Journal.
Recent progress in data integration
In the last few years there has been
considerable interest in the problem of providing access to large collections
of distributed
heterogeneous information sources (e.g.,
sources on the World-Wide Web, company-wide databases). This interest has
spawned a significant amount of research in Database Systems and related
fields (e.g., Artificial Intelligence, Operating Systems, Human Computer
Interaction). This has led to the development of several research prototypes
for information integration and recently, we are seeing the beginnings
of an industry addressing this problem.
The goal of this tutorial is to survey
the work on information integration, to illustrate the common principles
underlying this body
of work, to assess the state of the
art, and identify the open research problems in this area. The tutorial
will illustrate the issues
involved in information integration
through several implemented systems.
Managing Multimedia Information in a Database Environment
We discuss multimedia information management from the point of view of database systems: how the various aspects of database design and the modules of a database system have evolved over the years to better manage multimedia information. We start by examining the nature of multimedia data and the area of multimedia data modeling. We then discuss how multimedia data has influenced the evolution of the various modules of a standard database system; specifically, those having to do with query processing, choice of access methods, query optimization, transaction management, buffer management, storage management, recovery, and security. Finally, we consider the various commercial systems that have recently appeared which manage multimedia information.