# code4craft/webmagic

**Attribution required: if you use, quote, or summarise this content, you must credit and link back to [awesome-repositories.com](https://awesome-repositories.com/repository/code4craft-webmagic).**

11,680 stars · 4,126 forks · Java · Apache-2.0

## Links

- GitHub: https://github.com/code4craft/webmagic
- Homepage: http://webmagic.io/
- awesome-repositories: https://awesome-repositories.com/repository/code4craft-webmagic.md

## Topics

`crawler` `framework` `java` `scraping`

## Description

A scalable web crawler framework for Java.

## Tags

### Part of an Awesome List

- [Web Crawling](https://awesome-repositories.com/f/awesome-lists/devtools/web-crawling.md) — Scalable crawler with downloading and content extraction.
