Case for e-science studentships at UCL (P.Clarke, S.Bhatti, J.Crowcroft)

We request the award of one or more e-science studentships to work in the area of high rate data replication and the network services which are essential to the operation of most, if not all, Grid applications. This studentship bid is made jointly between the Physics and Computer Science departments at UCL. The two groups have matched interests, which provide an excellent base to host studentships to work in an area which is core to e-science operations and manifestly essential to the delivery of PPARC objectives. The HEP Physics group brings a very focused application (the Particle Physics and EU DataGrids) and a demonstrated interest in the networking issues associated with Grid applications. The Computer Science department brings a wealth of expertise in the entire area of data transport protocols, QoS and traffic management techniques, which will underpin all Grid operations The research programme we are following encompasses many several key areas, each of which will offer a research student(s) will gain valuable and practical experience:

High rate/High volume/Reliable Data transport applications. Grid data rates will exceed 100s of Mbit/s over long latency routes. Such transfers will rely upon the availability of high rate/high volume/reliable data transport applications. All three elements are key: the combination of rate and volume are clear, but it is also essential that such applications can recover from faults and complete the transport. The transport protocols will also have to deal with efficient replication and update to multiple sites. It is already apparent that protocols such as “standard” TCP based FTP are unlikely to be adequate and therefore today we do not know how to satisfy the DataGrid demands. This is an area where significant work has been done in the CS community based upon modeling and controlled measurements, however it is vital to understand these effects in the context of real Grid traffic patterns running on the wide area network (WAN). The work includes measurements and comparisons to theoretical models; investigation of the suitability of enhanced TCP based applications (e.g. taking account of explicit knowledge of resource availability and congestion notification) and addition of reliability strategies to the use of FTP. In the longer term there are proposals to use non-TCP based transport over the long haul parts of the route and for multi-site replication and update. The research will focus upon DataGrid testbed requirements between UK sites and from UK to CERN and the USA.

QoS and traffic management: Grid operation will require new services of the network fabric. In the immediate future  (DataGrid testbeds) there will be need for traffic management mechanisms to provide managed bandwidth over IP. There are various techniques including IP packet classification, DIFFSERV and MPLS, but none have yet been demonstrated in the Grid context and in particular across multiple administrative domains. In this context both groups are involved in a collaborative project involving UCL, Manchester, CLRC, UKERNA and which is being submitted to a leading supplier to the networking industry for support. The project aims to demonstrate traffic management services between main Grid Tier-N sites in the UK using the SuperJANET4 development network, and if possible extend this onward to CERN and the USA. This project will address multiple administrative domain issues and stress such end-to-end services using both pseudo and live Grid traffic. [Note: this work is core to all Grid activities. In this respect we expect to approach both the OST and DTI].

In the longer term Quality of Service (QoS) provision is probably crucial to HEP applications. QoS depends upon traffic classification at ingress to the network, and suitable treatment of aggregated classes at all points within the network. Eventually applications, which can respond to explicit congestion notification, must be developed. This Networked systems using IP is an area where the UCL CS group has wide and internationally recognised expertise (see http://www-mice.ucl.ac.uk/), including active participation within the IETF.

Network information services: Grid matchmaking services rely upon the availability of resource information. Such resource information includes knowledge of the static (i.e. latency, packet loss) and dynamic network conditions (i.e. on demand measurement of throughput between specific nodes).  Both groups both have prior experience, which can be used to develop a suitable information service for the DataGrid. A pilot project is underway to develop an LDAP based solution.

Data management and replication middleware: We are starting work based upon real testbed requirements in 2001, to demonstrate the use of LDAP based meta-data catalogues access to remote data from a local application using Globus services (GASS and GRAM) which link to http, ftp and other transport applications. This work will initially focus on the CDF testbed.

We have a wide programme of work directed at general e-science requirements and is by construction working toward PPARC e-science goals. The PhD studentship programme we propose would give experience in all of these areas through immersion in the combined programme. We envisage that the specific research topic to concentrate upon data transport middleware and protocols, and the provision of network information to the middleware layer. We have also a potential PhD CASE studentship programme concentrating upon the traffic engineering aspects of our programme. This will be the subject of a direct CASE application later but we would readily accommodate this now if more than one studentship is awarded (converting to a CASE later).

The studentships will be jointly supervised by Physics and CS.  The academic staff involved are:

Peter Clarke: Reader in Physics, HEP LHC group leader, active in HEP Object Oriented applications and networking for 4 years, EU-DataGrid PTB member and WP7 representative, coordinating UK network fabric activity and working on the PPARC DataGrid proposal.

S.Bhatti: Lecturer in CS, researches into QoS (applications and networks), network management, network security and mobile systems.

J. CrowCroft: Professor in CS. Research in multi-media Communications, traffic engineering and QoS. Member of the ACM and the British Computer Society, a fellow of the IEE and the royal academy of engineering and a Senior Member the IEEE.