Sprint: Speculative Prefetching of Remote Data (SPLASH 2011 - OOPSLA)

Fri 21 - Thu 27 October 2011 Portland, Oregon, United States

Who

Arun Raman, Greta Yorsh, Martin Vechev, Eran Yahav

Track

SPLASH 2011 OOPSLA

Abstract

Remote data access latency is a significant performance bottleneck in many modern programs that use remote databases and web services. We present Sprint - a run-time system for optimizing such programs by prefetching and caching data from remote sources in parallel to the execution of the original program. Sprint separates the concerns of exposing potentially-independent data accesses from the mechanism for executing them efficiently in parallel or in a batch. In contrast to prior work, Sprint can efficiently prefetch data in the presence of irregular or input-dependent access patterns, while preserving the semantics of the original program.

We used Sprint to automatically improve the performance of several real-world Java programs that access remote databases (MySQL, DB2) and web services (Facebook, IBM’s Yellow Pages). Sprint achieves speedups ranging 2.4x to 15.8x over sequential execution, which are comparable to those achieved by manually modifying the program for asynchronous and batch execution of data accesses. Sprint provides a simple interface that allows a programmer to plug in support for additional data sources without modifying the client program.

DOI

https://doi.org/10.1145/2048066.2048088

Sprint: Speculative Prefetching of Remote Data

Arun RamanAuthor

Greta YorshAuthor

Jane Street

Martin VechevAuthor

ETH Zurich, Switzerland

Eran YahavAuthor

Technion

Israel

Tracks

Co-hosted Conferences

Workshops

Co-hosted Symposia