AngleSharp 1.4.1-beta.506

logo

AngleSharp

CI GitHub Tag NuGet Count Issues Open Gitter Chat StackOverflow Questions CLA Assistant

AngleSharp is a .NET library that gives you the ability to parse angle bracket based hyper-texts like HTML, SVG, and MathML. XML without validation is also supported by the library. An important aspect of AngleSharp is that CSS can also be parsed. The included parser is built upon the official W3C specification. This produces a perfectly portable HTML5 DOM representation of the given source code and ensures compatibility with results in evergreen browsers. Also standard DOM features such as querySelector or querySelectorAll work for tree traversal.

:zapzap: Migrating from AngleSharp 0.9 to AngleSharp 0.10 or later (incl. 1.0)? Look at our migration documentation. :zapzap:

Key Features

  • Portable (using .NET Standard 2.0)
  • Standards conform (works exactly as evergreen browsers)
  • Great performance (outperforms similar parsers in most scenarios)
  • Extensible (extend with your own services)
  • Useful abstractions (type helpers, jQuery like construction)
  • Fully functional DOM (all the lists, iterators, and events you know)
  • Form submission (easily log in everywhere)
  • Navigation (a BrowsingContext is like a browser tab - control it from .NET!).
  • LINQ enhanced (use LINQ with DOM elements, naturally without wrappers)

The advantage over similar libraries like HtmlAgilityPack is that the exposed DOM is using the official W3C specified API, i.e., that even things like querySelectorAll are available in AngleSharp. Also the parser uses the HTML 5.1 specification, which defines error handling and element correction. The AngleSharp library focuses on standards compliance, interactivity, and extensibility. It is therefore giving web developers working with C# all possibilities as they know from using the DOM in any modern browser.

The performance of AngleSharp is quite close to the performance of browsers. Even very large pages can be processed within milliseconds. AngleSharp tries to minimize memory allocations and reuses elements internally to avoid unnecessary object creation.

Simple Demo

The simple example will use the website of Wikipedia for data retrieval.

var config = Configuration.Default.WithDefaultLoader();
var address = "https://en.wikipedia.org/wiki/List_of_The_Big_Bang_Theory_episodes";
var context = BrowsingContext.New(config);
var document = await context.OpenAsync(address);
var cellSelector = "tr.vevent td:nth-child(3)";
var cells = document.QuerySelectorAll(cellSelector);
var titles = cells.Select(m => m.TextContent);

Or the same with explicit types:

IConfiguration config = Configuration.Default.WithDefaultLoader();
string address = "https://en.wikipedia.org/wiki/List_of_The_Big_Bang_Theory_episodes";
IBrowsingContext context = BrowsingContext.New(config);
IDocument document = await context.OpenAsync(address);
string cellSelector = "tr.vevent td:nth-child(3)";
IHtmlCollection<IElement> cells = document.QuerySelectorAll(cellSelector);
IEnumerable<string> titles = cells.Select(m => m.TextContent);

In the example we see:

  • How to setup the configuration for supporting document loading
  • Asynchronously get the document in a new context using the configuration
  • Performing a query to get all cells with the content of interest
  • The whole DOM supports LINQ queries

Every collection in AngleSharp supports LINQ statements. AngleSharp also provides many useful extension methods for element collections that cannot be found in the official DOM.

Supported Platforms

AngleSharp has been created as a .NET Standard 2.0 compatible library. This includes, but is not limited to:

  • .NET Core (2.0 and later)
  • .NET Framework (4.6.2 and later)
  • Xamarin.Android (7.0 and 8.0)
  • Xamarin.iOS (10.0 and 10.14)
  • Xamarin.Mac (3.0 and 3.8)
  • Mono (4.6 and 5.4)
  • UWP (10.0 and 10.0.16299)
  • Unity (2018.1)

Documentation

The documentation of AngleSharp is located in the docs folder. More examples, best-practices, and general information can be found there. The documentation also contains a list of frequently asked questions.

More information is also available by following some of the hyper references mentioned in the Wiki. In-depth articles will be published on the CodeProject, with links being placed in the Wiki at GitHub.

Use-Cases

  • Parsing HTML (incl. fragments)
  • Parsing CSS (incl. selectors, declarations, ...)
  • Constructing HTML (e.g., view-engine)
  • Minifying CSS, HTML, ...
  • Querying document elements
  • Crawling information
  • Gathering statistics
  • Web automation
  • Tools with HTML / CSS / ... support
  • Connection to page analytics
  • HTML / DOM unit tests
  • Automated JavaScript interaction
  • Testing other concepts, e.g., script engines
  • ...

Vision

The project aims to bring a solid implementation of the W3C DOM for HTML, SVG, MathML, and CSS to the CLR - all written in C#. The idea is that you can basically do everything with the DOM in C# that you can do in JavaScript (plus, of course, more).

Most parts of the DOM are included, even though some may still miss their (fully specified / correct) implementation. The goal for v1.0 is to have all practically relevant parts implemented according to the official W3C specification (with useful extensions by the WHATWG).

The API is close to the DOM4 specification, however, the naming has been adjusted to apply with .NET conventions. Nevertheless, to make AngleSharp really useful for, e.g., a JavaScript engine, attributes have been placed on the corresponding interfaces (and methods, properties, ...) to indicate the status of the field in the official specification. This allows automatic generation of DOM objects with the official API.

This is a long-term project which will eventually result in a state of the art parser for the most important angle bracket based hyper-texts.

Our hope is to build a community around web parsing and libraries from this project. So far we had great contributions, but that goal was not fully achieved. Want to help? Get in touch with us!

Participating in the Project

If you know some feature that AngleSharp is currently missing, and you are willing to implement the feature, then your contribution is more than welcome! Also if you have a really cool idea - do not be shy, we'd like to hear it.

If you have an idea how to improve the API (or what is missing) then posts / messages are also welcome. For instance there have been ongoing discussions about some styles that have been used by AngleSharp (e.g., HTMLDocument or HtmlDocument) in the past. In the end AngleSharp stopped using HTMLDocument (at least visible outside of the library). Now AngleSharp uses names like IDocument, IHtmlElement and so on. This change would not have been possible without such fruitful discussions.

The project is always searching for additional contributors. Even if you do not have any code to contribute, but rather an idea for improvement, a bug report or a mistake in the documentation. These are the contributions that keep this project active.

Live discussions can take place in our Gitter chat, which supports using GitHub accounts.

More information is found in the contribution guidelines. All contributors can be found in the CONTRIBUTORS file.

This project has also adopted the code of conduct defined by the Contributor Covenant to clarify expected behavior in our community.

For more information see the .NET Foundation Code of Conduct.

Funding / Support

If you use AngleSharp frequently, but you do not have the time to support the project by active participation you may still be interested to ensure that the AngleSharp projects keeps the lights on.

Therefore we created a backing model via Bountysource. Any donation is welcome and much appreciated. We will mostly spend the money on dedicated development time to improve AngleSharp where it needs to be improved, plus invest in the web utility eco-system in .NET (e.g., in JavaScript engines, other parsers, or a renderer for AngleSharp to mention some outstanding projects).

Visit Bountysource for more details.

Development

AngleSharp is written in the most recent version of C# and thus requires Roslyn as a compiler. Using an IDE like Visual Studio 2019+ is recommended on Windows. Alternatively, VSCode (with OmniSharp or another suitable Language Server Protocol implementation) should be the tool of choice on other platforms.

The code tries to be as clean as possible. Notably the following rules are used:

  • Use braces for any conditional / loop body
  • Use the -Async suffixed methods when available
  • Use VIP ("Var If Possible") style (in C++ called AAA: Almost Always Auto) to place types on the right

More important, however, is the proper usage of tests. Any new feature should come with a set of tests to cover the functionality and prevent regression.

Changelog

A very detailed changelog exists. If you are just interested in major releases then have a look at the GitHub releases.

.NET Foundation

This project is supported by the .NET Foundation.

License

AngleSharp is released using the MIT license. For more information see the license file.

Showing the top 20 packages that depend on AngleSharp.

Packages Downloads
AngleSharp.Css
Extends the CSSOM from the core AngleSharp library.
15
AngleSharp.Css
Extends the CSSOM from the core AngleSharp library.
16
AngleSharp.Css
Extends the CSSOM from the core AngleSharp library.
17
AngleSharp.Css
Extends the CSSOM from the core AngleSharp library.
18
AngleSharp.Diffing
Provides a complete diffing model of HTML.
15
AngleSharp.Wrappers
A library of wrappers for AngleSharp. Allows you to replace a real AngleSharp DOM tree but still keep the queried/returned node references. Built for and used by https://github.com/egil/razor-components-testing-library.
15

.NET Standard 2.0

.NET Framework 4.6.2

.NET Framework 4.7.2

.NET 6.0

  • No dependencies.

.NET 7.0

  • No dependencies.

.NET 8.0

  • No dependencies.

.NET 10.0

  • No dependencies.

Version Downloads Last updated
1.4.1-beta.506 1 17.12.2025
1.4.1-beta.505 1 17.12.2025
1.4.1-beta.504 1 17.12.2025
1.4.1-beta.502 2 16.11.2025
1.4.0 2 14.11.2025
1.4.0-beta.499 1 16.11.2025
1.4.0-beta.497 1 16.11.2025
1.4.0-beta.496 1 16.11.2025
1.4.0-beta.495 4 16.11.2025
1.4.0-beta.493 4 16.11.2025
1.3.1 3 09.11.2025
1.3.1-beta.491 11 03.08.2025
1.3.1-beta.490 6 03.08.2025
1.3.1-beta.486 12 28.05.2025
1.3.0 12 23.05.2025
1.3.0-beta.484 13 24.05.2025
1.3.0-beta.477 12 23.05.2025
1.3.0-beta.476 12 23.05.2025
1.3.0-beta.470 12 24.05.2025
1.3.0-beta.468 12 14.03.2025
1.3.0-beta.466 11 14.03.2025
1.2.0 12 14.01.2025
1.2.0-beta.457 12 14.03.2025
1.2.0-beta.456 15 14.03.2025
1.2.0-beta.449 16 28.01.2025
1.2.0-beta.448 11 18.02.2025
1.2.0-beta.439 14 29.12.2024
1.2.0-beta.431 11 27.01.2025
1.2.0-beta.423 14 16.01.2025
1.2.0-beta.420 15 28.01.2025
1.2.0-beta.419 12 28.01.2025
1.2.0-beta.418 11 28.01.2025
1.2.0-beta.410 11 28.01.2025
1.2.0-beta.408 10 01.01.2025
1.1.2 19 13.05.2024
1.1.2-beta.407 13 31.12.2024
1.1.2-beta.395 14 31.12.2024
1.1.1 14 23.01.2025
1.1.1-beta.392 12 15.01.2025
1.1.1-beta.390 13 15.01.2025
1.1.1-beta.389 14 31.12.2024
1.1.1-beta.388 15 16.01.2025
1.1.1-beta.387 15 24.01.2025
1.1.1-beta.386 11 16.01.2025
1.1.1-beta.385 11 23.01.2025
1.1.0 15 16.01.2025
1.1.0-beta.384 14 23.01.2025
1.1.0-alpha-379 10 14.03.2025
1.1.0-alpha-378 14 24.01.2025
1.1.0-alpha-377 12 23.01.2025
1.1.0-alpha-376 11 24.01.2025
1.1.0-alpha-375 14 24.01.2025
1.1.0-alpha-374 10 23.01.2025
1.0.7 13 27.01.2025
1.0.7-alpha-342 16 23.01.2025
1.0.6 13 24.01.2025
1.0.6-alpha-341 10 20.02.2025
1.0.6-alpha-339 11 23.01.2025
1.0.6-alpha-331 14 23.01.2025
1.0.6-alpha-330 13 18.02.2025
1.0.6-alpha-328 17 24.01.2025
1.0.6-alpha-325 17 24.01.2025
1.0.6-alpha-321 15 23.01.2025
1.0.5 11 24.01.2025
1.0.5-alpha-317 15 24.01.2025
1.0.4 12 16.01.2025
1.0.4-alpha-316 14 24.01.2025
1.0.4-alpha-314 15 18.02.2025
1.0.4-alpha-311 17 24.01.2025
1.0.4-alpha-307 13 24.01.2025
1.0.4-alpha-301 17 16.01.2025
1.0.4-alpha-300 10 23.01.2025
1.0.4-alpha-298 16 15.01.2025
1.0.4-alpha-290 11 24.01.2025
1.0.4-alpha-289 10 24.01.2025
1.0.3 16 23.01.2025
1.0.3-alpha-287 14 23.01.2025
1.0.2 13 28.01.2025
1.0.2-alpha-284 13 24.01.2025
1.0.2-alpha-283 16 23.01.2025
1.0.2-alpha-282 13 23.01.2025
1.0.2-alpha-281 15 24.01.2025
1.0.2-alpha-278 13 24.01.2025
1.0.2-alpha-277 11 24.01.2025
1.0.2-alpha-276 15 23.01.2025
1.0.2-alpha-275 10 23.01.2025
1.0.2-alpha-274 14 24.01.2025
1.0.2-alpha-273 14 24.01.2025
1.0.2-alpha-261 13 15.01.2025
1.0.2-alpha-258 14 16.01.2025
1.0.2-alpha-257 11 15.01.2025
1.0.2-alpha-255 15 15.01.2025
1.0.2-alpha-251 14 23.01.2025
1.0.2-alpha-250 11 24.01.2025
1.0.2-alpha-249 7 14.03.2025
1.0.1 11 23.01.2025
1.0.1-alpha-248 14 16.01.2025
1.0.1-alpha-243 12 17.02.2025
1.0.1-alpha-242 13 15.01.2025
1.0.1-alpha-241 17 23.01.2025
1.0.1-alpha-235 14 15.01.2025
1.0.0 10 16.01.2025
1.0.0-ci-228 16 16.01.2025
1.0.0-alpha-231 10 16.01.2025
1.0.0-alpha-229 16 02.01.2025
0.17.1 12 16.01.2025
0.17.1-alpha-179 16 23.01.2025
0.17.1-alpha-178 14 16.01.2025
0.17.0 13 16.01.2025
0.17.0-alpha-177 10 16.01.2025
0.17.0-alpha-174 10 16.01.2025
0.17.0-alpha-173 13 19.02.2025
0.17.0-alpha-172 14 18.02.2025
0.17.0-alpha-171 13 16.01.2025
0.17.0-alpha-170 10 23.01.2025
0.17.0-alpha-169 12 24.01.2025
0.16.1 11 18.02.2025
0.16.1-alpha-99 13 24.01.2025
0.16.1-alpha-96 9 16.01.2025
0.16.1-alpha-91 12 23.01.2025
0.16.1-alpha-168 12 24.01.2025
0.16.1-alpha-167 11 24.01.2025
0.16.1-alpha-155 12 28.01.2025
0.16.1-alpha-153 15 28.01.2025
0.16.1-alpha-152 12 24.01.2025
0.16.1-alpha-148 14 24.01.2025
0.16.1-alpha-145 13 16.01.2025
0.16.1-alpha-144 12 23.01.2025
0.16.1-alpha-133 12 24.01.2025
0.16.1-alpha-127 12 24.01.2025
0.16.1-alpha-125 11 23.01.2025
0.16.1-alpha-120 15 16.01.2025
0.16.1-alpha-114 10 16.01.2025
0.16.1-alpha-112 12 28.01.2025
0.16.1-alpha-110 16 13.02.2025
0.16.1-alpha-108 14 23.01.2025
0.16.1-alpha-106 13 24.01.2025
0.16.1-alpha-104 15 28.01.2025
0.16.0 11 28.01.2025
0.16.0-alpha-86 10 16.01.2025
0.16.0-alpha-85 13 16.01.2025
0.16.0-alpha-84 14 07.01.2025
0.16.0-alpha-80 15 02.01.2025
0.16.0-alpha-79 10 16.01.2025
0.16.0-alpha-78 14 15.01.2025
0.16.0-alpha-77 11 28.01.2025
0.16.0-alpha-76 11 17.01.2025
0.16.0-alpha-75 11 31.12.2024
0.16.0-alpha-72 10 15.01.2025
0.15.0 15 31.12.2024
0.15.0-alpha-14 11 31.12.2024
0.14.0 11 16.01.2025
0.14.0-alpha-818 15 31.12.2024
0.14.0-alpha-817 12 31.12.2024
0.14.0-alpha-813 12 16.01.2025
0.14.0-alpha-811 15 23.01.2025
0.14.0-alpha-809 12 31.12.2024
0.14.0-alpha-805 10 01.01.2025
0.14.0-alpha-803 18 01.01.2025
0.14.0-alpha-802 10 31.12.2024
0.14.0-alpha-801 11 31.12.2024
0.14.0-alpha-798 12 16.01.2025
0.14.0-alpha-796 9 16.01.2025
0.14.0-alpha-794 11 19.02.2025
0.14.0-alpha-793 10 18.02.2025
0.14.0-alpha-790 17 23.01.2025
0.14.0-alpha-789 13 31.12.2024
0.14.0-alpha-788 10 31.12.2024
0.14.0-alpha-787 10 16.01.2025
0.14.0-alpha-784 11 03.02.2025
0.14.0-alpha-783 9 16.01.2025
0.13.0 14 16.01.2025
0.13.0-alpha-782 12 16.01.2025
0.13.0-alpha-775 11 15.01.2025
0.13.0-alpha-771 10 28.01.2025
0.13.0-alpha-768 14 08.01.2025
0.13.0-alpha-766 12 27.01.2025
0.13.0-alpha-764 11 31.12.2024
0.13.0-alpha-763 13 01.01.2025
0.13.0-alpha-760 10 16.01.2025
0.13.0-alpha-758 11 28.01.2025
0.13.0-alpha-756 14 06.01.2025
0.13.0-alpha-754 12 31.12.2024
0.13.0-alpha-748 16 24.01.2025
0.13.0-alpha-745 11 01.01.2025
0.13.0-alpha-744 9 16.01.2025
0.13.0-alpha-743 11 23.01.2025
0.13.0-alpha-742 12 28.01.2025
0.13.0-alpha-739 10 16.01.2025
0.13.0-alpha-737 12 16.01.2025
0.13.0-alpha-735 12 24.01.2025
0.13.0-alpha-734 12 28.01.2025
0.13.0-alpha-733 10 15.01.2025
0.12.1 12 31.12.2024
0.12.0 15 31.12.2024
0.11.0 14 16.01.2025
0.10.1 10 16.01.2025
0.10.0 11 18.02.2025
0.9.11 11 16.01.2025
0.9.10 11 24.01.2025
0.9.9.2 10 28.01.2025
0.9.9.1 11 28.01.2025
0.9.9 13 16.01.2025
0.9.8.1 18 31.12.2024
0.9.8 12 18.02.2025
0.9.7 12 27.01.2025
0.9.6 12 16.01.2025
0.9.5 10 31.12.2024
0.9.4 10 12.02.2025
0.9.3 10 16.01.2025
0.9.2 20 13.01.2025
0.9.1 14 31.12.2024
0.9.0 13 01.01.2025
0.8.9 13 16.01.2025
0.8.8 14 01.01.2025
0.8.7.1 14 01.01.2025
0.8.7 16 31.12.2024
0.8.6 18 16.01.2025
0.8.5 17 31.12.2024
0.8.4.1 13 31.12.2024
0.8.4 15 01.01.2025
0.8.3 0 21.04.2015
0.8.2 0 15.04.2015
0.8.1 13 31.12.2024
0.8.0 11 16.01.2025
0.7.0 11 31.12.2024
0.6.1 15 05.02.2025
0.6.0 13 31.12.2024
0.5.1 14 01.01.2025
0.5.0 14 16.01.2025
0.4.0 14 16.01.2025
0.3.7 14 02.01.2025
0.3.6 15 01.01.2025
0.3.5 15 16.01.2025
0.3.4 11 01.01.2025
0.3.3 18 01.01.2025
0.3.2 14 16.01.2025
0.3.1 13 31.12.2024
0.3.0 15 16.01.2025
0.2.9 11 01.01.2025
0.2.8 11 24.01.2025
0.2.7 14 02.01.2025
0.2.6 13 01.01.2025
0.2.5 15 16.01.2025
0.2.4 11 16.01.2025
0.2.3 13 14.03.2025
0.2.2 16 31.12.2024
0.2.1 11 29.01.2025
0.2.0 14 31.12.2024