{"id":92,"date":"2018-07-28T12:05:37","date_gmt":"2018-07-28T10:05:37","guid":{"rendered":"http:\/\/egert.org\/blog\/?p=92"},"modified":"2018-12-27T07:34:42","modified_gmt":"2018-12-27T06:34:42","slug":"what-pro-data-analysis-can-learn-from-strava","status":"publish","type":"post","link":"https:\/\/egert.org\/blog\/2018\/07\/28\/what-pro-data-analysis-can-learn-from-strava\/","title":{"rendered":"What Pro Data Analysis can learn from Strava"},"content":{"rendered":"<p>&nbsp;<\/p>\n<p><a href=\"http:\/\/www.strava.com\"><img loading=\"lazy\" decoding=\"async\" class=\"alignright wp-image-96\" src=\"http:\/\/egert.org\/blog\/wp-content\/uploads\/2018\/07\/strava_logo_orange.png\" alt=\"\" width=\"281\" height=\"115\" srcset=\"https:\/\/egert.org\/blog\/wp-content\/uploads\/2018\/07\/strava_logo_orange.png 408w, https:\/\/egert.org\/blog\/wp-content\/uploads\/2018\/07\/strava_logo_orange-300x123.png 300w\" sizes=\"(max-width: 281px) 100vw, 281px\" \/><\/a>If you are a cyclist or a runner, chances are that you use or have at least heard of Strava. Strava is a platform for athletes to analyze and share their activities\u2019 data and virtually compete with one another.<\/p>\n<p><span style=\"font-weight: 400;\">With the rise of sports tracking devices tracing position and pacing via GPS and additionally measuring heart rate and elevation, Strava leverages this data that users upload to create a frame of reference for several types of activities. First founded in 2009, Strava has more than 10 Mio active members (in fact they emphasize not to call them users) in more than 195 countries. In 2017 alone, cyclists shared over 7.3 Bio km worth of rides.<\/span><\/p>\n<p><span style=\"font-weight: 400;\">In 2009, when they began as a typical Californian data start up, they were highly dependent on the hardware vendor Garmin: In fact, in the beginning uploading data to Strava was only possible directly from a Garmin device leaving early Strava at the mercy of Garmin. Today, the power dynamics have changed a lot: It is now Strava-compatibility that drives hardware sales. Automatic data synchronisation to Strava or even live Strava powered analytics during an activity enable not only Garmin but also their competitors like Polar and Wahoo to sell their newest generations of devices.<\/span><\/p>\n<h2><span style=\"font-weight: 400;\">How do they make money<\/span><\/h2>\n<p><span style=\"font-weight: 400;\">Strava\u2019s main sources of revenue is first of all their premium membership options (59,99\u20ac a year or 7,99\u20ac a month). <\/span><\/p>\n<p><span style=\"font-weight: 400;\">Secondly, industry partners can sponsor challenges, that is specific goals on a specific time frame, Strava members can commit to to motivate themselves. One example is the Rapha 500 Challenge [1] of the bike vendor Rapha challenging its participants to ride 500km between Christmas Eve and New Year\u2019s Eve.<\/span><\/p>\n<p><span style=\"font-weight: 400;\">As Big Data company, the selling of data to third party is also part of Strava\u2019s business. For now, they are committed to share data only in an aggregated and therefore anonymized form with partners that are aligned with Strava\u2019s vision of enabling and helping athletes. Notably, the project Strava Metro [2] aims to partner with city planners around the globe to make e.g. bike paths and most frequent bike tracks safer. On their website, you can find a case study of the partnership with the Seattle Department of Transportation.<\/span><\/p>\n<p><span style=\"font-weight: 400;\">By 2018, Strava has yet to become profitable.<\/span><\/p>\n<h2><span style=\"font-weight: 400;\">My personal use cases<\/span><\/h2>\n<p><span style=\"font-weight: 400;\">To navigate on my bike following pre-planned tracks, I bought a Garmin Edge 800 a few years ago. For this, I create a GPX, short for GPS exchange format, file of my route and upload it onto my Garmin device. <\/span><\/p>\n<figure id=\"attachment_98\" aria-describedby=\"caption-attachment-98\" style=\"width: 649px\" class=\"wp-caption alignleft\"><img loading=\"lazy\" decoding=\"async\" class=\"wp-image-98\" src=\"http:\/\/egert.org\/blog\/wp-content\/uploads\/2018\/07\/Bildschirmfoto-2018-07-28-um-11.53.04-1024x678.png\" alt=\"\" width=\"649\" height=\"430\" srcset=\"https:\/\/egert.org\/blog\/wp-content\/uploads\/2018\/07\/Bildschirmfoto-2018-07-28-um-11.53.04-1024x678.png 1024w, https:\/\/egert.org\/blog\/wp-content\/uploads\/2018\/07\/Bildschirmfoto-2018-07-28-um-11.53.04-300x199.png 300w, https:\/\/egert.org\/blog\/wp-content\/uploads\/2018\/07\/Bildschirmfoto-2018-07-28-um-11.53.04-768x509.png 768w, https:\/\/egert.org\/blog\/wp-content\/uploads\/2018\/07\/Bildschirmfoto-2018-07-28-um-11.53.04-1200x795.png 1200w, https:\/\/egert.org\/blog\/wp-content\/uploads\/2018\/07\/Bildschirmfoto-2018-07-28-um-11.53.04.png 1640w\" sizes=\"(max-width: 649px) 100vw, 649px\" \/><figcaption id=\"caption-attachment-98\" class=\"wp-caption-text\">GPX track covering both the Gampen Pass and Mendel Pass in italy planned on GPSies<\/figcaption><\/figure>\n<p>&nbsp;<\/p>\n<p><span style=\"font-weight: 400;\">GPX is an XML schema which can be used to track GPS based waypoints \u00a0and routes together with timestamps. In this format, one can both store pre-planned routes which can be later used for navigation as well as recording timestamps when passing these waypoints on a bike ride or a run. I use my Garmin both for navigation as well as recording, but any smartphone can do the trick as well (within its battery limitations).<\/span><\/p>\n<p><span style=\"font-weight: 400;\">When I ride, I have my track as a purple line embedded into a map [3] that I can follow to pick the right turns. After my ride, I use Garmin\u2019s own software Garmin Express to read out the recorded GPS\/time data as well as my heart rate. It is automatically transferred to the Garmin platform Garmin Connect. Garmin connect offers similar features as Strava while being restricted to its own devices. In my opinion, their dashboard composition used to be a bit messy. The new modern look has improved matters quite a lot, however this was too late and many users like myself went to look for alternatives.<\/span><\/p>\n<p><span style=\"font-weight: 400;\">Garmin exposes data of newly created activities to Strava via an API, automatically uploading any rides making them visible to my community of friends and acquaintances. Over there, I get an instant analysis of my ride: How I performed on pre-defined segments during my ride: \u00a0Did I hit a personal best? Have I been able to score top 10 for women? How do I rank compared to my friends that have done this particular segment as well? Subsequently, my activity becomes visible to my friends (or to the world if I chose so).<\/span><\/p>\n<figure id=\"attachment_99\" aria-describedby=\"caption-attachment-99\" style=\"width: 752px\" class=\"wp-caption alignright\"><img loading=\"lazy\" decoding=\"async\" class=\"wp-image-99 size-large\" src=\"http:\/\/egert.org\/blog\/wp-content\/uploads\/2018\/07\/Bildschirmfoto-2018-07-28-um-11.55.47-752x1024.png\" alt=\"\" width=\"752\" height=\"1024\" srcset=\"https:\/\/egert.org\/blog\/wp-content\/uploads\/2018\/07\/Bildschirmfoto-2018-07-28-um-11.55.47-752x1024.png 752w, https:\/\/egert.org\/blog\/wp-content\/uploads\/2018\/07\/Bildschirmfoto-2018-07-28-um-11.55.47-220x300.png 220w, https:\/\/egert.org\/blog\/wp-content\/uploads\/2018\/07\/Bildschirmfoto-2018-07-28-um-11.55.47-768x1046.png 768w, https:\/\/egert.org\/blog\/wp-content\/uploads\/2018\/07\/Bildschirmfoto-2018-07-28-um-11.55.47.png 922w\" sizes=\"(max-width: 752px) 100vw, 752px\" \/><figcaption id=\"caption-attachment-99\" class=\"wp-caption-text\">Activity on Strava<\/figcaption><\/figure>\n<p>&nbsp;<\/p>\n<p><span style=\"font-weight: 400;\">From a data analysis perspective, Strava does a few things well from which the pro data analysis world could benefit as well.<\/span><\/p>\n<p>&nbsp;<\/p>\n<h2><span style=\"font-weight: 400;\">Easy and powerful visualizations and tracking tools geared for its user base yield a powerful Business Intelligence<\/span><\/h2>\n<p><span style=\"font-weight: 400;\">One of the main challenges for most amateur athletes is to keep up motivation to continue with one\u2019s sport. On the one hand, it\u2019s the community part that allows sharing your passion, but as well your challenges with your friends in real life like your bike club or with other like minded people that you know only online.\u00a0<\/span><\/p>\n<p>On the other hand, you can follow your own progress and try to beat your past self. I particularly like the feature of tracking the number of weekly activities, the overall length and elevation gain to motivate myself to keep up my rides and training.<\/p>\n<p>Strava, of course, benefits from the fact that their are a lot of canonical KPIs for sports activities such as distance covered, speed, heart rate, elevation gain etc. that quite easily open the door to make a sports tracking platform&#8217;s insights relevant and meaningful for the user.<\/p>\n<p>Neatly visualizing this data adapted to the needs of the particular type of sport is on the other hand much more difficult. In my opinion, Strava&#8217;s success is mainly due to its strength there outperforming Garmin with a little-cluttered interface and visuals.<\/p>\n<h2><span style=\"font-weight: 400;\">Well incentivized community dynamics keep the platform and its data relevant<\/span><\/h2>\n<p><span style=\"font-weight: 400;\">The heart of Strava\u2019s data analysis capabilities are segments. Segments are short tracks of variable length between two points which cover a part of a road or a route. A typical example would be the start of a slope of a mountain to its highest point. Users can create segments themselves, but also flag segments as duplicates or irrelevant (sprints of only a couple of meters).<\/span><\/p>\n<p><span style=\"font-weight: 400;\">Even though Strava has recently invested in getting rid of most obvious duplicates in segments, it mostly relies on the communities to do their own clean ups: A lot of Strava members develop quite some enthusiasm to curate the most relevant segments that appear on their routes in order to track and showcase their performances.<\/span><\/p>\n<p><span style=\"font-weight: 400;\">The same is true for fraudulent data: If you track your \u201cperformance\u201d on an e-Bike or a motorcycle in order to score a good ranking on an at least moderately frequented segment, you can be sure that other Top 10-candidates will be quick to report the activity to get a \u201cfair\u201d ranking.<\/span><\/p>\n<p><span style=\"font-weight: 400;\">This principle of community police as curator allows to avoid one of the most common threats of any Big Data endeavour, namely the loss of meaning of data due to spam and irrelevance.<\/span><\/p>\n<h2><span style=\"font-weight: 400;\">Gather your stars as marketers <\/span><\/h2>\n<p><span style=\"font-weight: 400;\">On the one hand, some KOMs and QOMs, short for King of the mountain and Queen of the mountain denoting the respective leader on a segment, are pretty much completely out of reach if a major competition has traversed one\u2019s territory, it is invariably cool that one can also follow people like Romain Bardet (who is competing in the Tour de France at the moment) and see how they perform on your favourite segment. <\/span><\/p>\n<p><span style=\"font-weight: 400;\">Below is a screenshot of a segment that I just rode. &#8211; A segment that has been part of this year\u2019s edition of the Giro d\u2019Italia race allowing to compare world class cyclists Romain Bardet and Vicenzo Nibali. <\/span><\/p>\n<p><img loading=\"lazy\" decoding=\"async\" class=\"alignnone size-large wp-image-94\" src=\"http:\/\/egert.org\/blog\/wp-content\/uploads\/2018\/07\/strava-foza-pass-1024x446.png\" alt=\"\" width=\"1024\" height=\"446\" srcset=\"https:\/\/egert.org\/blog\/wp-content\/uploads\/2018\/07\/strava-foza-pass-1024x446.png 1024w, https:\/\/egert.org\/blog\/wp-content\/uploads\/2018\/07\/strava-foza-pass-300x131.png 300w, https:\/\/egert.org\/blog\/wp-content\/uploads\/2018\/07\/strava-foza-pass-768x334.png 768w, https:\/\/egert.org\/blog\/wp-content\/uploads\/2018\/07\/strava-foza-pass-1200x522.png 1200w\" sizes=\"(max-width: 1024px) 100vw, 1024px\" \/><\/p>\n<p><span style=\"font-weight: 400;\">Having the industry stars on ones platform is a great marketing coup to showcase ones functionalities and gives professional athletes a platform to interact with their fans. <\/span><\/p>\n<h2><span style=\"font-weight: 400;\">A thought-through Premium membership principle<\/span><\/h2>\n<p><span style=\"font-weight: 400;\">Currently, I use the free membership option of Strava. The premium option would allow me to get more detailed analysis such as power meter analysis, live feedback and personalized coaching to reach more customized goals.<\/span><\/p>\n<p><span style=\"font-weight: 400;\">One fun example of what could be gained from a premium membership is the possibility to get live segment information during my ride: I would see exactly how I\u2019d need to perform to score a good ranking on say my favorite hill.\u00a0<\/span><span style=\"font-weight: 400;\">Strava\u2019s philosophy is that most people will sign up for the free option and quite steadily go for the premium option once they have been with Strava for a while.<\/span><\/p>\n<p><span style=\"font-weight: 400;\">And even while you are not paying, you are still contributing to the richness of data accumulated and curated in Strava.\u00a0<\/span><span style=\"font-weight: 400;\">While Strava has yet to reach profitability, this balance seems to be quite powerful for Strava to generate value to both members and partners.<\/span><\/p>\n<p><span style=\"font-weight: 400;\">The big key word for the future of Strava is \u2018Discovery\u2019: Assume you travel to a new city and you want to go for a run. Strava knows your typical distance and whether you like hilly terrain or flats and can recommend you routes that other athletes just like you do in this particular city.\u00a0<\/span><span style=\"font-weight: 400;\">To which extent this will be part of the premium part of Strava, is not yet clear, but to me, these kinds of recommendations would be very valuable and something, I would definitely consider paying for.<\/span><\/p>\n<h2><span style=\"font-weight: 400;\">Grow with challenges<\/span><\/h2>\n<p><span style=\"font-weight: 400;\">As a data company\/social media platform, you are under constant public scrutiny. In the beginning of the year, a story broke of a secret US military air base [5] being exposed on a Strava heat map: Soldiers had been recording their training as \u2018public\u2019 on Strava. Even though the data was anonymized, having a well-frequented running course in an Afghanistan desert left not much room for speculation.<\/span><\/p>\n<p><span style=\"font-weight: 400;\">Even though one can clearly argue that this incident was largely due to the carelessness of the people uploading their data publicly without second thought, this still is a challenge for a community to educate its members on the consequences of privacy. This holds both for Strava itself, but also for mainstream journalism who mistakenly called this a \u2018data leak\u2019 or \u2018data breach\u2019 which it most definitely wasn\u2019t. <\/span><\/p>\n<p><span style=\"font-weight: 400;\">Strava itself took action to highlight in detail the opting out possibilities in order to avoid these including the introduction of a minimum numbers of activities for a path to show up on any heat map. Furthermore, heat maps are refreshed regularly so that activities that are later made private no longer show up. This means that even if a group of soldiers mistakenly uploads an activity of a secret location, they can still take action to have it be hidden and further damage can be avoided.<\/span><\/p>\n<p><span style=\"font-weight: 400;\">Another ongoing discussion that concerns a far greater base of users is the possibility to opt out of certain aspects of data sharing. Unfortunately, many athletes, in particular women, are not comfortable sharing timed location data of their runs publicly, since it could be very easy for stalkers or even attackers to guess patterns and pose a serious threat. For now, the only option is to not share an activity at all publicly. Strava has said that they currently explore of how to make only parts of an activity publicly visible while still integrating the other relevant parts of the activity. \u00a0<\/span><\/p>\n<p>&nbsp;<\/p>\n<h2>Notes<\/h2>\n<p><span style=\"font-weight: 400;\">[1] Rapha 500 Challenge, see e.g. the 2016 version here:https:\/\/www.strava.com\/challenges\/rapha-festive500-2016<\/span><\/p>\n<p>[2] Strava Metro: Insights business for city planning\u00a0<a href=\"https:\/\/metro.strava.com\/\">https:\/\/metro.strava.com\/<\/a><\/p>\n<p><span style=\"font-weight: 400;\">[3] For planning, I use <\/span><a href=\"http:\/\/www.gpsies.com\"><span style=\"font-weight: 400;\">www.gpsies.com<\/span><\/a><span style=\"font-weight: 400;\">. Strava offers its own planning tool, Strava routes <\/span><a href=\"https:\/\/www.strava.com\/routes\"><span style=\"font-weight: 400;\">https:\/\/www.strava.com\/routes<\/span><\/a><span style=\"font-weight: 400;\">. Out of say a bit arbitrary historic reasons, this tool however has not yet gained much traction in my friend cycle after the tool recommended to one of our more passionate road cyclist to use a less than optimal gravel path for his precious road bike.<\/span><\/p>\n<p><span style=\"font-weight: 400;\">[4] Open Street Map <\/span><a href=\"https:\/\/www.openstreetmap.org\/\"><span style=\"font-weight: 400;\">https:\/\/www.openstreetmap.org\/<\/span><\/a><span style=\"font-weight: 400;\"> is an open source mapping database permitting to download any map selection in Garmin-compatible formats<\/span><\/p>\n<p><span style=\"font-weight: 400;\">[5] BBC article on the Strava Military Base incident from 28\/01\/2018 https:\/\/www.bbc.com\/news\/technology-42853072<\/span><\/p>\n","protected":false},"excerpt":{"rendered":"<p>&nbsp; If you are a cyclist or a runner, chances are that you use or have at least heard of Strava. Strava is a platform for athletes to analyze and share their activities\u2019 data and virtually compete with one another. With the rise of sports tracking devices tracing position and pacing via GPS and additionally [&hellip;]<\/p>\n","protected":false},"author":1,"featured_media":0,"comment_status":"closed","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[1],"tags":[38,35,37,34],"class_list":["post-92","post","type-post","status-publish","format-standard","hentry","category-uncategorized","tag-b2c","tag-bi","tag-big-data","tag-strava"],"_links":{"self":[{"href":"https:\/\/egert.org\/blog\/wp-json\/wp\/v2\/posts\/92"}],"collection":[{"href":"https:\/\/egert.org\/blog\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/egert.org\/blog\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/egert.org\/blog\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/egert.org\/blog\/wp-json\/wp\/v2\/comments?post=92"}],"version-history":[{"count":5,"href":"https:\/\/egert.org\/blog\/wp-json\/wp\/v2\/posts\/92\/revisions"}],"predecessor-version":[{"id":102,"href":"https:\/\/egert.org\/blog\/wp-json\/wp\/v2\/posts\/92\/revisions\/102"}],"wp:attachment":[{"href":"https:\/\/egert.org\/blog\/wp-json\/wp\/v2\/media?parent=92"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/egert.org\/blog\/wp-json\/wp\/v2\/categories?post=92"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/egert.org\/blog\/wp-json\/wp\/v2\/tags?post=92"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}