Skip to content

Commit 5163f2f

Browse files
committed
Merge branch 'main' into gh-pages
2 parents 82c36d0 + ed42394 commit 5163f2f

1 file changed

Lines changed: 18 additions & 50 deletions

File tree

index.html

Lines changed: 18 additions & 50 deletions
Original file line numberDiff line numberDiff line change
@@ -3113,7 +3113,7 @@ <h3 class="mt-4 display-6">Web Crawling and Scraping</h3>
31133113
<div class="col-12 col-6 col-sm-6 col-lg-3 mb-4">
31143114
<div class="card">
31153115
<div class="card-body">
3116-
<a href="https://scrapy.org/" target="_blank">
3116+
<a href="https://github.com/scrapy/scrapy" target="_blank">
31173117
<img class="card-img-top p-1" src="static/image/logos/scrapy.png" style="width:100%; height:auto;" alt="Scrapy">
31183118
</a>
31193119
<p class="card-text text-muted mt-3">A fast high-level web crawling & scraping framework</small></p>
@@ -3168,7 +3168,7 @@ <h2>
31683168
<figure class="p-3 mb-0">
31693169
<blockquote class="blockquote">
31703170
<p>
3171-
<a href="https://mechanicalsoup.readthedocs.io/" target="_blank">
3171+
<a href="https://github.com/MechanicalSoup/MechanicalSoup" target="_blank">
31723172
A Python library for automating interaction with websites.
31733173
</a>
31743174
</p>
@@ -3180,34 +3180,16 @@ <h2>
31803180
<div class="col-12 col-6 col-sm-6 col-lg-3 mb-4">
31813181
<div class="card">
31823182
<div class="card-body text-end">
3183-
<a href="https://selenium-python.readthedocs.io/" target="_blank">
3183+
<a href="https://github.com/SeleniumHQ/selenium/tree/trunk/py" target="_blank">
31843184
<img class="card-img-top p-1" src="static/image/logos/selenium.webp" style="width:70%; height:auto;" alt="selenium">
31853185
</a>
31863186
<h5 class="card-title" style="background-color:#156741;">
3187-
<a href="https://selenium-python.readthedocs.io/" style="color:#fff;height:20px" target="_blank">
3187+
<a href="https://github.com/SeleniumHQ/selenium/tree/trunk/py" style="color:#fff;height:20px" target="_blank">
31883188
Selenium
31893189
</a>
31903190
</h5>
31913191
<p class="card-text mt-3"><small>A browser automation framework and ecosystem.</small></p>
3192-
<p class="card-text"><a href="https://www.selenium.dev/" target="_blank">Selenium</a> is an umbrella project for a range of tools and libraries that enable and support the automation of web browsers.</p>
3193-
</div>
3194-
</div>
3195-
</div>
3196-
3197-
<div class="col-12 col-6 col-sm-6 col-lg-3 mb-4">
3198-
<div class="card">
3199-
<div class="card-body text-center">
3200-
<a href="https://github.com/howie6879/ruia" target="_blank">
3201-
<img class="card-img-top p-1" src="static/image/logos/ruia.png" style="width:30%; height:auto;" alt="Ruia">
3202-
</a>
3203-
<h5 class="card-title">
3204-
<a href="https://github.com/howie6879/ruia" target="_blank">
3205-
Ruia
3206-
</a>
3207-
</h5>
3208-
<p class="card-text text-muted mt-3">Async Python 3.6+ web scraping micro-framework based on asyncio.</small></p>
3209-
<hr>
3210-
<p class="card-text">Ruia is an async web scraping micro-framework, written with asyncio and aiohttp, aims to make crawling url as convenient as possible.</p>
3192+
<p class="card-text"><a href="https://github.com/SeleniumHQ/selenium" target="_blank">Selenium</a> is an umbrella project for a range of tools and libraries that enable and support the automation of web browsers.</p>
32113193
</div>
32123194
</div>
32133195
</div>
@@ -3229,20 +3211,6 @@ <h5 class="card-title">
32293211
</div>
32303212
</div>
32313213

3232-
<div class="col-12 col-6 col-sm-6 col-lg-3 mb-4">
3233-
<div class="card p-3">
3234-
<a href="https://roniemartinez.github.io/dude/" target="_blank">
3235-
<h2 style="background-color: #2094f3; color:#fff">Dude</h2>
3236-
</a>
3237-
<hr class="my-2">
3238-
<figure class="p-2 mb-0">
3239-
<blockquote class="blockquote">
3240-
<p><i>dude</i> uncomplicated data extraction: A simple framework for writing web scrapers using Python decorators</p>
3241-
</blockquote>
3242-
</figure>
3243-
</div>
3244-
</div>
3245-
32463214
<div class="col-12 col-6 col-sm-6 col-lg-3 mb-4">
32473215
<div class="card p-3">
32483216
<a href="https://github.com/adbar/trafilatura" target="_blank">
@@ -3268,21 +3236,9 @@ <h2 style="background-color: #2094f3; color:#fff">Dude</h2>
32683236
</div>
32693237
</div>
32703238

3271-
<div class="col-12 col-6 col-sm-6 col-lg-3 mb-4">
3272-
<div class="card">
3273-
<div class="card-body">
3274-
<a href="https://github.com/Gerapy/Gerapy" target="_blank">
3275-
<h3>Gerapy</h3>
3276-
</a>
3277-
<hr class="my-2">
3278-
<p class="card-text mt-3"><small>Distributed Crawler Management Framework Based on Scrapy, Scrapyd, Scrapyd-Client, Scrapyd-API, Django and Vue.js.</small></p>
3279-
</div>
3280-
</div>
3281-
</div>
3282-
32833239
<div class="col-12 col-6 col-sm-6 col-lg-3 mb-4">
32843240
<div class="card p-3 text-center">
3285-
<a href="https://crawlee.dev/python/" target="_blank">
3241+
<a href="https://github.com/apify/crawlee-python" target="_blank">
32863242
<img class="card-img-top" src="static/image/logos/crawlee.svg" alt="Crawlee">
32873243
</a>
32883244
<figure class="p-3 mb-0">
@@ -3299,6 +3255,18 @@ <h3>Gerapy</h3>
32993255
</div>
33003256
</div>
33013257

3258+
<div class="col-12 col-6 col-sm-6 col-lg-3 mb-4">
3259+
<div class="card">
3260+
<div class="card-body">
3261+
<a href="https://github.com/Gerapy/Gerapy" target="_blank">
3262+
<h3>Gerapy</h3>
3263+
</a>
3264+
<hr class="my-2">
3265+
<p class="card-text mt-3"><small>Distributed Crawler Management Framework Based on Scrapy, Scrapyd, Scrapyd-Client, Scrapyd-API, Django and Vue.js.</small></p>
3266+
</div>
3267+
</div>
3268+
</div>
3269+
33023270
</div>
33033271
</div>
33043272
<!--End of Web Crawling and Scraping-->

0 commit comments

Comments
 (0)