Features List – Aspose.HTML for .NET
Aspose.HTML for .NET is a headless browser, written in C#, which offers the following features: create, open existing, manipulate, navigate through and convert HTML, XHTML, SVG, EPUB, MHTML and Markdown documents into the various supported output formats such as: PDF, XPS, Markdown, JPG, PNG, etc. All the core modules are implemented according to the official HTML, CSS and JavaScript documentations, which leads to high quality parsing, conversion and manipulation results.
Aspose.HTML for .NET provides a rich set of features, related to many different areas:
Document Manipulation
- Create a new or open an existing (X)HTML or SVG document. With the help of Aspose.HTML for .NET you will be able to open document not only from the local file, but directly from the web!
- Save the document along with all the referenced resources, such as CSS and images. Aspose.HTML for .NET offers you a highly customizable saver, which is able to save not only the document itself, with all the referenced resources but also other documents referenced by it.
- Create, edit, remove and replace HTML nodes via the rich API based on the Document Object Model (DOM) defined in the official documentation.
- Wide possibilities of environment customization. You can specify user style sheet, font folder, external resource handler and much more.
Data Extraction
Although Aspose.HTML for .NET is not a web scraper by itself, but it can be used to collect information from the opened document:
- Navigate through the HTML elements with the help of DOM API. Aspose.HTML for .NET implements Traversal interfaces which allows you to easily navigate through the DOM tree.
- Collect information from the HTML documents, using high performance XPath queries. Aspose.HTML for .NET supports XPath 1.0 interfaces, defined in the official documentation.
- Collect HTML elements, by executing CSS Selector queries. Aspose.HTML for .NET implements CSS Selectors API according to the lates documentation.
Conversion
One of the main goals of Aspose.HTML for .NET – is to provide simple, highly customizable and precise converters.
Simplicity: .NET API provides you the ability to convert HTML, XHTML, SVG, EPUB and MHTML to many different formats with just one line of code. Here is an example of HTML to PDF conversion:
1 Aspose.Html.Converters.Converter.ConvertHTML(@"<span>Hello, World!!</span>", ".", new Aspose.Html.Saving.PdfSaveOptions(), "output.pdf");
Сustomizability: With Aspose.HTML for .NET you will be able to fine-tune many aspects of the conversion process. Many other products will allow you to setup page size or image resolution, but you will be able to customize processing of external resources, or specify custom style sheet for your document and much more.
Precision: All the components of Aspose.HTML for .NET assembly are designed according to the official HTML and SVG documentations. Which leads to high precision conversion results.
Aspose.HTML for .NET implements many converters, which can be split into three groups:
- Fixed layout converters. These converters are responsible for conversion of HTML, XHTML, SVG, EPUB and MHTML to PDF, XPS and raster image formats (PNG, JPG, BMP).
- Markdown converters. These converters are responsible for conversion of Markdown to HTML and HTML to Markdown.
- Template converter. Which is used to create HTML file from Template.
You can explore all the available converters on this page.
Built-in Formats
Real world HTML page – is a very complex set of formats, working together. That’s why development of high quality converter is such a complex task. Aspose.HTML for .NET includes implementations of all the HTML related formats:
- Parser works according to the latest HTML5 specification. This means that Aspose.HTML for Python via .NET will always provide you with the correct DOM tree.
- Our JavaScript processing engine supports ECMA 5.1 specification, which means that the majority of scripts will be processed during the document’s parsing or runtime. We also provide the ability to specify JavaScript processing timeout or completely disable scripts execution.
- CSS3 is supported out-of-the-box. Aspose.HTML for .NET provides an advanced CSS manipulation API and allows to manipulate stylesheets either for the whole document or for the particular node.
- HTML Canvas is also supported by Aspose.HTML for .NET. You can draw on it, using JavaScript or HTML Canvas API.
- SVG parsing and rendering is based on the lates SVG 2.0 documentation. Which allows you to use such modern elements as filters!
Platform Independence
- You can use Aspose.HTML for .NET to build any type of a 32-bit or 64-bit .NET application including ASP.NET, WCF, WinForms, .NET Core etc.
- Aspose.HTML for .NET runs on both Windows and Linux operating systems.
- Full support of all the .NET Framework versions starting from 2.0 and higher, including Client Profile versions.
- Aspose.HTML for .NET provides you the .NET Standard 2.0 assembly, which can be used with frameworks that implement it, such as .NET Core 2.0.
Helpful Extensions
- Form Editor – is an utility class, which allows you to easily edit and submit HTML forms. You can find usage examples and detailed description on this page.
- Vendor Specific CSS – Aspose.HTML for .NET provides you its own CSS properties, which extend existing ones. As an example they can be used to add page numbers.