ExcavatorSharp is a multi-threaded server for scraping web data. It converts HTML code into a structured array of data. The library allows data scraping from multiple sites in parallel mode, within a single running application. Create scraping tasks and perform data extraction on a schedule.
The library is designed for professional extraction and parsing of large volumes of data. Under the hood there are .css-selectors and xpath support, data export into .csv/.xlsx/.sql/.json, online data export, support for proxy servers, dynamic content crawling, interaction with the site via javascript and much more. The library uses .NET Sockets and Chromium Embedded Framework.
The library can be used separately as crawler or parser. We support the formats sitemap.xml and robots.txt. We support the gzip / deflate compression.
Attention! Only x64 versions are supported for .NET 4.5.2 and 4.6 platforms. AnyCPU build does not support! You will NOT be able to run the library when building AnyCPU. This is caused by the features of CEF.
The library is designed for professional extraction and parsing of large volumes of data. Under the hood there are .css-selectors and xpath support, data export into .csv/.xlsx/.sql/.json, online data export, support for proxy servers, dynamic content crawling, interaction with the site via javascript and much more. The library uses .NET Sockets and Chromium Embedded Framework.
The library can be used separately as crawler or parser. We support the formats sitemap.xml and robots.txt. We support the gzip / deflate compression.
Attention! Only x64 versions are supported for .NET 4.5.2 and 4.6 platforms. AnyCPU build does not support! You will NOT be able to run the library when building AnyCPU. This is caused by the features of CEF.
For projects that support PackageReference, copy this XML node into the project file to reference the package.
The NuGet Team does not provide support for this client. Please contact its maintainers for support.
Multiply 4 3 2
Release Notes
Why Choose Webscraperapp? We Are With You Every Step Of The Way! Whether you are a seasoned Seller, or just starting out with your first Inventory and Order tool, our professional staff is here to assist you every step of the way. Aug 08, 2018 WebScraper 4.3.2 – Scan and output website data as CSV or JSON. August 8, 2018 WebScraper uses the Integrity v6 engine to quickly scan a website, and can output the data (currently) as CSV or JSON.
1) Added ability to extract data from iframe blocks
2) Added possibility to take a screenshot in the project testing mode
3) Fixed current errors and increased productivity
2) Added possibility to take a screenshot in the project testing mode
3) Fixed current errors and increased productivity
Dependencies
- cef.redist.x64(>= 79.1.36)
- cef.redist.x86(>= 79.1.36)
- CefSharp.Common(>= 75.1.360)
- CefSharp.OffScreen(>= 75.1.360)
- EPPlus(<= 4.5.3.3)
- HtmlAgilityPack(>= 1.11.23)
- HtmlAgilityPack.CssSelectors(>= 1.0.2)
- Newtonsoft.Json(>= 12.0.3)
- RestSharp(>= 106.10.1)
Used By
NuGet packages
This package is not used by any NuGet packages.
GitHub repositories
Web Scraper 4 3 20
This package is not used by any popular GitHub repositories.
Version History
Version | Downloads | Last updated |
---|---|---|
1.2.8 | 107 | 8/10/2020 |
1.2.7 | 81 | 8/10/2020 |
1.2.3 | 113 | 5/20/2020 |
1.2.2 | 66 | 5/10/2020 |
1.2.1 | 91 | 5/5/2020 |
1.2.0 | 87 | 4/30/2020 |
1.1.0 | 91 | 4/23/2020 |
1.0.53 | 103 | 4/12/2020 |
1.0.52 | 106 | 4/11/2020 |
1.0.51 | 107 | 4/11/2020 |
1.0.6 | 81 | 4/23/2020 |
1.0.5 | 110 | 4/11/2020 |
1.0.4 | 148 | 4/3/2020 |
1.0.3 | 94 | 2/12/2020 |
1.0.2 | 167 | 1/30/2020 |
1.0.1 | 107 | 1/30/2020 |
1.0.0 | 87 | 1/23/2020 |
![Technique Technique](https://image.winudf.com/v2/image1/Y29tLmN0aW9uLnBsYXllcmdhbWVzX3NjcmVlbl82XzE1NTMyMTQ0MjlfMDE5/screen-6.jpg?fakeurl=1&type=.jpg)
Home » Mac » Developer Tools » WebScraper
Start Download Now |
---|
webscraper.dmg | 4.01 MB |
Price | Free to try |
Version | 1.2 |
Category | Developer Tools |
Operating Systems | OS X 10.8, OS X 10.9, OS X 10.10, OS X 10.11, macOS 10.12 |
Publisher | Peacockmedia http://peacockmedia.software |
Publisher's Description | |
WebScraper uses the Integrity v6 Engine to quickly scan a website, and can output the data (currently) as csv or json. The output can include various meta data, the entire content of each page (as text, html or markdown) and divs or spans extracted by class or id. Flux 5 6 4 – advanced web design tool. Webscraper is new. Please use it for free and please get in touch with any requests, bug reports or observations.
|