Scrapfly SDK

Installation

pip install scrapfly-sdk

You can also install extra dependencies

pip install "scrapfly-sdk[seepdup]" for performance improvement
pip install "scrapfly-sdk[concurrency]" for concurrency out of the box (asyncio / thread)
pip install "scrapfly-sdk[scrapy]" for scrapy integration
pip install "scrapfly-sdk[scrapy]" Everything!

Get Your API Key

You can create a free account on Scrapfly to get your API Key.

Migration

Migrate from 0.7.x to 0.8

asyncio-pool dependency has been dropped

scrapfly.concurrent_scrape is now an async generator. If the concurrency is None or not defined, the max concurrency allowed by your current subscription is used.

    async for result in scrapfly.concurrent_scrape(concurrency=10, scrape_configs=[ScrapConfig(...), ...]):
        print(result)

brotli args is deprecated and will be removed in the next minor. There is not benefit in most of case versus gzip regarding and size and use more CPU.

What's new

0.8.x

Better error log
Async/Improvement for concurrent scrape with asyncio
Scrapy media pipeline are now supported out of the box

Name		Name	Last commit message	Last commit date
Latest commit History 123 Commits
docs		docs
examples		examples
scrapfly		scrapfly
.gitattributes		.gitattributes
.gitignore		.gitignore
LICENSE		LICENSE
Makefile		Makefile
README.md		README.md
setup.cfg		setup.cfg
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Scrapfly SDK

Installation

Get Your API Key

Migration

Migrate from 0.7.x to 0.8

What's new

0.8.x

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Scrapfly SDK

Installation

Get Your API Key

Migration

Migrate from 0.7.x to 0.8

What's new

0.8.x

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages