Data Centric Networking and Systems (2012-2013 Lent Term)
|
OverviewThis module provides an introduction to data centric
networking and systems, where data is a communication token in networking
and its impact to the computer system's architecture. Data centric
networking in distributed systems relies on content addressing instead of host
addressing, thus providing network independence for applications. Integration of
complex data processing with networking is a key vision for future computing.
Module StructureThe module consists of 8 sessions, of which 6 sessions focus on a specific aspect of the topic in data centric networking and systems research. Each session discusses 2-3 papers, led by the assigned students. Each student will present about 2 paper reviews during the course. The first session advises how to read/review a paper and a brief introduction of different perspectives in data centric networking. The last session is dedicated to the presentation of the open source project studies present by the students. One hands-on session on data-flow programing and one guest lectures are planned (subject to change), covering inspiring current research in the data centric networking and systems domain. Schedule and Reading ListWe’ll meet in SW01 every Tuesday (from January 22 to March 12) in 2013. The time slot is 14:00-16:00 on Tuesday. 2013/01/22 Session 1: Introduction to Data Centric Networking and Systems (DCN)
2013/01/29 Session 2: Content-Centric Networking (CCN) and Content Distribution Networks (CDN)
Ross Lagerwall (slides) 2.1. V. Jacobson, D. Smetters, J. Thornton, M. Plass, N. Briggs, R. Braynard: Networking Named Content, CoNEXT, 2009. 2.2. V. Jacobson, D. Smetters, J. Thornton, M. Plass, N. Briggs, R. Braynard: Networking Named Content, CACM, January, 2012. Valentin Dalibard (PhD Student) (slides) 3. A. Ghodsi, T. Koponen, B. Raghavan, S. Shenker, A. Singla, and J. Wilcox: Information-Centric Networking: Seeing the Forest for the Trees, HotNets, 2011. 4. P. Jokela, A. Zahemszky, C. E. Rothenberg, S. Arianfar, and P. Nikander: LIPSIN: Line Speed Publish/Subscribe Inter-networking, SIGCOMM, 2009. Leandro Nicosia (slides) 5. George Xylomenos, Xenofon Vasilakos, Christos Tsilopoulos, Vasilios A. Siris, and George C. Polyzos: Caching and Mobility Support in a Publish-Subscribe Internet Architecture, IEEE Communication, Vol 50, Issue 7, 2012. 6. Md. F. Bari, S. Chowdhury, R. Ahmed, R. Boutaba, and B. Mathieu: A Survey of Naming and Routing in Information-Centric Networks, IEEE Communication, Vol 50, Issue 7, 2012.
1.1. A. Carzaniga, D.S. Rosenblum, A.L. Wolf:
Achieving
scalability and expressiveness in an internet-scale event notification service, PODC, 2001. 1.3. A. Carzaniga, A.L. Wolf: Forwarding in a content-based network, SIGCOMM, 2003. 2. M. Castro, M. B. Jones, A-M. Kermarrec, A. Rowstron, M. Theimer, H. Wang and A. Wolman: An Evaluation of Scalable Application-level Multicast Built Using Peer-to-peer overlays, INFOCOM, 2003.
3.1. S. Ratnasamy, P. Francis, M. Handley, R. Karp, S. Shenker:
A scalable
content addressable network, SIGCOMM, 2001. 2013/02/05 Session 3: Programming in Data Centric Environment
1. Yuan Yu, Michael Isard, D. Fetterly, M. Budiu, U.
Erlingsson, P.K. Gunda, J. Currey:
DryadLINQ: A System for General-Purpose
Distributed Data-Parallel Computing Using a High-Level Language, OSDI, 2008.
Laurie James
(slides)
Bogdan-Alexandru Matican
(slides) Karthik Nilakant (PhD Student) including Naiad demo. (slides)
Frank McSherry's Talk Video is
here. Twitter Demo is
here. 6. Frank McSherry, Rebecca Isaacs, Michael Isard, and Derek G. Murray, Composable Incremental and Iterative Data-Parallel Computation with Naiad, no. MSR-TR-2012-105, 2012.
Albert Kim
(slides) 8. Dionysios Logothetis, Christopher Olston, Benjamin Reed, Kevin Webb and Kenneth Yocum: Stateful Bulk Processing for Incremental Analytics, SOCC, 2010. 2013/02/12 Session 4: MapReduce Handson Tutorial using CIEL with Amazon EC2
2013/02/19 Session 5: Stream Data Processing and Data/Query Model
1. V. Gulisano, R. Jimenez-Peris, M. Patiño-Martinez, P.
Valduriez:
StreamCloud: A Large Scale Data Streaming System, ICDCS, 2010.
Brett Lagerwall
(slides)
Georgiev Petko
(slides)
2013/02/26 Session 6: Graph Structured Data: Network, Storage, and Graph Processing
1. J. Pujol, V.
Erramilli, G. Siganos, X.
Yang, N. Laoutaris, P.
Chhabra, P. Rodriguez:
The Little Engine(s) That Could: Scaling Online Social
Networks, SIGCOMM, 2010.
Bogdan-Alexandru Matican (slides)
Leandro Nicosia
(slides)
Brett Lagerwall
(slides)
Albert Kim
(slides) 8. Dimitrios Prountzos Roman Manevich Keshav Pingali: Elixir: A System for Synthesizing Concurrent Graph Programs, OOPSLA, 2012. 2013/03/05 Session 7: Network holds Data in Delay Tolerant Networks (DTN)
1.1 E. Nordström, P. Gunningberg, C. Rohner:
A Search-based
Network Architecture for Mobile Devices, Uppsala University Technical Report
2009-003, 2009.
Georgiev Petko (slides) 5. A. Balasubramanian, R. Mahajan, and A. Venkataramani: Augmenting mobile 3g using wifi, MobiSys, 2010. 6. K. Lee, I. Rhee, J. Lee, S. Chong, and Y. Yi: Mobile data offloading: how much can wifi deliver? Co-NEXT, 2010. Compact routing
(Optional)
5.1. Dmitri Krioukov, kc claffy, Kevin Fall, Arthur Brady:
On compact routing for
the internet, ACM 37 (3), 2007. 15:45-16:00 Wrap-up Discussion
The reading club will require you to read between 1
and 3 papers every week. You need to fill out a review_log
(MS word format,
text
format) except sections 6&7 prior to each session and email me by 12:00 noon on
Monday. The minimum requirement of review_log is one per session, but
you can read as many as you want and fill the review_log for each paper you
read. The following three reports are required, which could
be extended from the reading assignment of the reading club or a different one
within the scope of data centric networking. The reports 1 and 2 should be handed in by the end of
5th week (Feb 19, 2013 - 12:00 noon ) and
7th week (March 18, 2013 - 12:00 noon) of
the course (not in any particular order). The report 3 should be by the end of
the Lent term (March 26, 2013 - 12:00 noon). The final grade for the course will be provided as a
letter grade or percentage and the assessment will consist of two parts: See the candidates of
Open Source Projects in data centric networking. The list
is not exhausted. If you take anything other than the one in the list, please
discuss with me. The purpose of this assignment is to understand the prototype
of the proposed architecture, algorithms, and systems through running an actual
prototype and present/explain to the other people how the prototype runs, any
additional work you have done including your own applications and setup process
of the prototype. This experience will give you better understanding of the
project. These Open Source Projects come with a set of published papers and you
should be able to examine your interests in the paper through running the
prototype. Some projects are rather large and may require extensive environment
and time; make sure you are able to complete this assignment. The following papers aid how to read/review a paper.
Further supplement: see ‘how to read/review a paper’
section in
Advanced Topics in Computer Systems by Steven Hand.
Presentations should be about 20-25 minutes long,
where you need to cover the following aspects. What are the background and the problem domain of
the paper? What is the motivation of the presented work? What is the
difference from the existing works? What is the novel idea? How did the paper
change/unchange the research in the research community? What is the significant contribution? How did the
authors tackle the problem? Did the authors obtain expected result from their
trial? The following document aids in presenting a review. Perdita Stevens in University of Edinburgh
How to
[read, present, review] a research paper. A survey paper provides the readers with an exposition of
existing work that is comprehensive and organized. It must expose relevant
details associated in the surveying area, but it is important to keep a
consistent level of details and to avoid simply listing the different works.
Thus a good survey paper should demonstrate a summary of recent research results
in a novel way that integrates and adds understanding to work in the field. For
example, you can take an approach by classifying the existing literature in your
own way; develop a perspective on the area, and evaluate trends. Thus, after
defining the scope of your survey, 1) classify and organize the trend, 2)
critical evaluation of approaches (pros/cons), and 3) add your analysis or
explanation (e.g. table, figure). Also adding reference and pointer to further
in-depth information is important (summary from Rich Wolski’s note).
Please email to
eiko.yoneki@cl.cam.ac.uk for your
submission of course work or any question. |