To students enrolled for DIS course for 2008-09 winter
semester- the course website is maintained in Moodle- the exact link
LSIR course, winter semester 2007/2008 (last year's web page)
Distributed Information Systems
course introduces in detail several key technologies underlying today's
distributed information systems. After introducing nonstandard data models
nowadays in use on the Web for information representation, we learn about
various aspects of processing this information at increasing levels of
abstraction, starting from the physical aspects of managing distributed
data up to the extraction of new information from existing data by means
of data mining. The specific focus will be on managing Web and mobile
We assume students to be familiar with the course relational databases.
This year, we will have weekly exercises where we will provide questions and datasets (input/output) for selected algorithmic problems. Participation is facultative and the exercises will not be corrected nor graded. We will provide and explain the solutions at the subsequent week. We will also provide exercises and exam questions from previous years.
Bonus Intermediate Exams
We will have two written intermediate bonus exams, which each count 25%. An intermediate exam counts only if the grade is better than the grade in the final exam.
Support: Lecture slides, exercises, handwritten notes.
There will be a final written exam. It will consist of conceptual questions similar to those posed throughout the lecture and of examples similar to those from the exercises. Support: Lecture slides, exercises, handwritten notes, basic calculator.
Time and Place
Lecture: Tuesday 8h00am-10h00am, room
Exercise: Tuesday 10h00am -11h00am, room
News and announcements
There will be a Q&A session before the final
exam, from 16h00 to 18h00, Jan 17th,
2008 in BC03.
- Solution of the 2nd immediate
exam are available
here. To see your grades, go to Wojciech's office (BC 143)
on Monday, Wednesday, and Friday afternoon, from 14h00-15h00
(till December 21st). Otherwise, send an email to
for an appointment.
- A blog has been created at
to facilitate the discussion among students and assistants on
- Slides of Week 11
- More detailed
explanations for the exercises on P2P networks are
Note: The contents of the lectures might be (slightly) updated
before the class.
- M. Tamer Özsu, Patrick Valduriez: Principles of Distributed Database Systems, Second Edition, Prentice Hall, ISBN 0-13-659707-6, 1999.
- S. Abiteboul, P. Bunemann, D. Suciu: Data on the Web: >From Relations to Semistructured Data and XML, Morgan Kaufman, 2000.
- Ricardo Baeza-Yates, Berthier Ribeiro-Neto, Modern Information Retrieval (Acm Press Series), Addison Wesley, 1999.
- Jiawei Han, Data Mining: concepts and techniques, Morgan Kaufman, 2000, ISBN 1-55860-489-8
- P. Baldi, P. Frasconi, P. Smyth: Modeling the Internet and the Web, Wiley 2003
- Daniel Barbará: Mobile Computing and Databases - A Survey. TKDE 11(1): 108-117 (1999)
- Swarup Acharya, Rafael Alonso, Michael J. Franklin, Stanley B. Zdonik: Broadcast Disks: Data Management for Asymmetric Communications Environments. SIGMOD Conference 1995: 199-210
- Sohail Hameed, Nitin H. Vaidya: Log-Time Algorithms for Scheduling Single and Multiple Channel Data Broadcast. MOBICOM 1997: 90-99
- Tomasz Imielinski, S. Viswanathan, B. R. Badrinath: Data on Air: Organization and Access. TKDE 9(3): 353-372 (1997)
- Ion Stoica, Robert Morris, David Karger, Frans Kaashoek, Hari Balakrishnan. Chord: A Scalable Peer-To-Peer Lookup Service for Internet Applications. Proceedings of the ACM SIGCOMM, 2001.
- Sylvia Ratnasamy, Paul Francis, Mark Handley, Richard Karp, Scott Shenker. A Scalable Content-Addressable Network. Proceedings of the ACM SIGCOMM, 2001.
- M.A. Jovanovic, F.S. Annexstein, and K.A.Berman. Scalability Issues in Large Peer-to-Peer Networks - A Case Study of Gnutella. University of Cincinnati, Laboratory for Networks and Applied Graph Theory, 2001.
- Frank Dabek, Emma Brunskill, M. Frans Kaashoek, David Karger, Robert Morris, Ion Stoica, Hari Balakrishnan. Building Peer-to-Peer Systems With Chord, a Distributed Lookup Service. Proceedings of the 8th Workshop on Hot Topics in Operating Systems (HotOS-VIII), 2001.
- Ian Clarke, Oskar Sandberg, Brandon Wiley, and Theodore W. Hong. Freenet: A Distributed Anonymous Information Storage and Retrieval System. Designing Privacy Enhancing Technologies: International Workshop on Design Issues in Anonymity and Unobservability. LLNCS 2009. Springer Verlag 2001.
- The Free Network Project
- Karl Aberer. P-Grid: A self-organizing access structure for P2P information systems. Proceedings of the Sixth International Conference on Cooperative Information Systems (CoopIS 2001), 2001.
- MICHAEL W. BERRY, SUSAN T. DUMAIS, GAVIN W. O'BRIEN. USING LINEAR ALGEBRA FOR INTELLIGENT INFORMATION RETRIEVAL. Department of Computer Science, University of Tennessee, Knoxville, Dec. 1994.
- Gio Wiederhold. Mediators in the architecture of future information systems. IEEE Computer Magazine, March 1992.
- L.Liu, L.L.Yan, M.T.Ozsu. Interoperability in large scale distributed information delivery systems