Topics: Big data in networks; Traffic characterization and traffic models for networks
Authors: Charles Thomas, Jeff Kline and Paul Barford (ComScore, USA)
Presenter bio: Charles Thomas currently lives in Madison, Wisconsin, USA. He spent 26 years at the University of Wisconsin, Madison, where he was developed novel research computing and large-scale network management softwar systems. Charles designed and built the high-performance JavaScript telemetry tag that was central to data collection for the fraud detectio company, MdotLabs. In 2015, MdotLabs was acquired by comScore, a leadin web metrics company, where he now works. Currently he is engaged in research and software development in the AdTech problem space.
Abstract:
Collecting information from web clients
without explicit input from users is important in a variety of contexts
including content customization, experience personalization, accounting
and online advertising. A standard approach for gathering web client
telemetry is through deployment of Javascript instrumentation that is
placed either directly on web pages or through third-party "tags" that
are referenced in web pages. In this paper we present a design study of
web client measurement methods. The objective of our work is to enhance
understanding of best practices in web client measurement toward the
goal of developing future tags that are reliable, robust and efficient.
We begin by conducting a detailed examination of Javascript
instrumentation collected from five well known third party services. Our
analysis shows that these code-bases have diverse capabilities and
return a broad range of client characteristics. Next, we describe a web
client measurement framework and an implementation that we call
IntegraTag, which enables us to examine details of performance, accuracy
and reliability through live deployments. We use IntegraTag to conduct
case studies of tag behavior on a single website which resulted in over
500K page-loads, and on a publisher network which resulted in over 150M
page-loads. We report on the wide range of client characteristics
returned by IntegraTag, as well as it's performance and robustness.