Posts Tagged ‘data’

Strava users help sports science – unwittingly

20/11/2013

Do you use Strava? If so, you may have contributed unwittingly to pioneering research that could help all cyclists.

Three Italian researchers accessed the data of almost 30,000 Strava-using cyclists. (Put simply, Strava is the social fitness app that tracks your ride and creates a leaderboard for all rides on the same route) The users were anonymous to the researchers so it could be anyone’s data – including your’s.

SocialBlog_In-Situ-1024x592Then the researchers mined this mountain of data. They wanted to see what kind of exercise leads to better performances.

Sports scientists, doctors, physiologists and others have been doing the same kind of research for decades. They’ve come up with a lot of credible theories – but they’ve been based on the results from a few dozen professional or, occasionally, a few hundred experienced participants.

Those old studies look tiny compared to the cohort used by Paolo Cintia, Luca Pappalardo and Dino Pedreschi of the Dept. of Informatics at the University of Pisa, Italy. Thanks to the Strava data, they had an enormous sample size of 29,284 cyclists to study. The vast majority of them would have been amateur (that’s you and me) and their fitness levels would have ranged from the near-elite to the pathetic (me).

In the old days the quantity of data would’ve been too much information to handle easily but, with every second of those riders’ activities in stored in digital form, the researchers were able to drill down relatively quickly.

Fortunately for all sports scientists and coaches, the findings from the huge sample corroborate what’s suspected already from the old, small studies. Exercise on its own doesn’t make you perform better; it’s down to training.

“Athletes that better improve their performance follow precise training patterns usually referred as overcompensation theory, with alternation of stress peaks and rest periods,” say Cintia, Pappalardo and Pedreschi.

“To the best of our knowledge, our study is the first corroboration on large scale of this theory, mainly confirming that “engine matters”, but tuning is fundamental,” they say.

Sadly the potential for data from Strava and other social fitness platforms to help scientists get new insights is now restricted. Paul Mach, an engineer at Strava and creator of Raceshape, implied by tweet that the researchers had only acquired the data because they had “hammered the v1/v2 API before it got shutdown. We blocked an Italian univ[ersity] IP a while back. Probably them.”

Screen shot 2013-11-20 at 15.55.01

This approach by Strava gives the company more control over who can access the anonymised data of its users. “Data acquiring has been a fundamental part of our work,” the Italian researchers say, “We did it through Strava’s API (version 2.0)*. Unfortunately, Strava changed his [sic] API policies in June 2013, so it is not possible to download data anymore.

“At that time, we asked Strava and they were still developing the new version of API. Now it seems that such new version is finally available but you need to request the access to Strava developers.”

Neveindexrtheless, expect more results soon from the Strava data that Cintia, Pappalardo and Pedreschi harvested before the tap was turned off. “Currently we are investigating other fascinating aspects emerging from Strava data, we hope to get new results for helping cyclists in their training life. Specially because we are cyclists, too,” they say.

And if Strava relents and let’s scientists get fresh data, it’s likely to be just one source of information that could benefit all of us, including the most pathetic riders (me) in unexpected ways. “We are sure that the increasing diffusion of training devices (powermeters, heart rate monitor etc) and social fitness applications will give us the possibility of a deep and new study of training science,” say Cintia, Pappalardo and Pedreschi.

Their research paper is available here and they were scheduled to present it in December 2013 at a workshop of the IEEE International Conference on Data Mining in Dallas, Texas

+++++++++++++

*Update at 16:56 GMT 20/11/13: From Paolo Cintia – “The data acquisiton was done after a request to Strava developer, explicitly highlighting the scientific and anonymous use. Furtherrmore the access to the API was public.”

Update at 09:00 GMT 5/3/14: From the researchers: “We are currently working on some improvements and extension of the cyclists’ study. Our purpose is to create a model able to detect if a person trains in the right way or not. In the meanwhile, we opened a blog where we tell in a divulgative way our scientific works. You can find a post about the cyclists’ study here

Advertisements