<?xml version="1.0" encoding="utf-8" standalone="yes"?><rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom"><channel><title>Webscraping on Migus Wong</title><link>https://miguswong.github.io/tags/webscraping/</link><description>Recent content in Webscraping on Migus Wong</description><generator>Hugo -- gohugo.io</generator><language>en</language><copyright>MIGUS WONG</copyright><lastBuildDate>Thu, 12 Dec 2024 14:48:13 -0600</lastBuildDate><atom:link href="https://miguswong.github.io/tags/webscraping/index.xml" rel="self" type="application/rss+xml"/><item><title>Webscraping Ironman Triathalon Results</title><link>https://miguswong.github.io/post/webscraping_iroman_results/</link><pubDate>Thu, 12 Dec 2024 14:48:13 -0600</pubDate><guid>https://miguswong.github.io/post/webscraping_iroman_results/</guid><description>
&lt;h1 id="introduction">Introduction&lt;/h1>
&lt;p>Ironman race results data was scraped from a &lt;a href="https://www.coachcox.co.uk/imstats/">third-party website&lt;/a> for the purpose of EDA. All the data and code used for extracting results data can be found in the following links:&lt;/p>
&lt;p>&lt;strong>GitHub Repository&lt;/strong> &lt;a href="https://github.com/miguswong/IronmanScraper">here&lt;/a>&lt;br>
&lt;strong>Kaggle Dataset&lt;/strong> &lt;a href="https://www.kaggle.com/datasets/miguswong/ironman-140-6-results-dataset-2002-2024">here&lt;/a>&lt;/p>
&lt;p>The following files contained in the Github (mainly the Jupyter notebook and Python script) were used to scrape 140.6 Ironman race results ranging from 2002 to 2024 (as of 12-05-2024). Note, the data was not scraped from the official Ironman website, but a &lt;a href="https://www.coachcox.co.uk/imstats/">proxy-website&lt;/a> not owned by Ironman.&lt;/p></description></item></channel></rss>