Aspose.Words for Python via .NET 21.11 Release Notes

This page contains release notes for Aspose.Words for Python via .NET 21.11.

Major Features

There are 64 improvements and fixes in this release. The most notable are:

New UseAttachments option has been implemented to display the attachments panel in the output PDF documents.
Added a new ChmLoadOptions to get more control over CHM loading.
Added new option to ignore field codes during find-replace operation.
Added ability to save images as Base64 string to Markdown format.

Full List of Issues Covering all Changes in this Release (Reported by .NET Users)

Key	Summary	Category
WORDSNET-1697	Add possibility to use complex Fill Effects for shapes (gradient, texture etc)	New Feature
WORDSNET-8696	Add new property of PdfSaveOptions to export SDT controls as editable PDF form fields	New Feature
WORDSNET-9945	Support native gradients with transparency in PDF	New Feature
WORDSNET-11453	Add feature to view/hide navigation panes of PDF	New Feature
WORDSNET-14472	Consider adding a Callback event that notifies the progress of loading document	New Feature
WORDSNET-14510	SDT controls are not editable in the produced PDF	New Feature
WORDSNET-14850	Export drop-down list content controls as ‘select’ HTML elements	New Feature
WORDSNET-17181	Block level SDT not converting to Editable AcroFrom field	New Feature
WORDSNET-18648	Text gradient support	New Feature
WORDSNET-22513	Support “ms-its” and other local URLs during import CHM from a stream	New Feature
WORDSNET-22787	Introduce ExportImagesAsBase64 option for MD format	New Feature
WORDSNET-22794	Implement MS Word scaling algorithm for non-native Ink objects when rendering	New Feature
WORDSNET-22803	Provide full support of ReportingEngine.UseReflectionOptimization for .NET Standard	New Feature
WORDSNET-22897	Check Aspose.Words for .NET Standard works with .NET 6.0	New Feature
WORDSNET-8920	SDT controls placed outside paragraph level are not preserved when converting to PDF	Enhancement
WORDSNET-11542	Provide ExportDropDownFormFieldAsText property for StructuredDocumentTag	Enhancement
WORDSNET-14318	Provide option to export SDT checkboxes as <input> in HTML	Enhancement
WORDSNET-16057	PDF with gradient fill print Issue with Chrome	Enhancement
WORDSNET-21698	Convert Word Content Controls to Editable Controls in PDF	Enhancement
WORDSNET-9273	Conversion of DOCX to DOC and PDF file format change the NUMPAGES value	Bug
WORDSNET-17043	PDF generated by Aspose.Words is blank when it is printed by Acrobat Reader DC	Bug
WORDSNET-17045	PDF with gradient fill print Issue with Chrome	Bug
WORDSNET-18805	Incorrect scaling of the secondary vertical axis after converting to PDF	Bug
WORDSNET-19157	Arrows in OfficeMath equation are rendered incorrectly in output EMF	Bug
WORDSNET-19158	OfficeMath renders incorrectly in output EMF	Bug
WORDSNET-19731	Incorrect calculation of the X-axis maximum when converting chart to PDF	Bug
WORDSNET-20917	Page numbering incorrect when Cover page present in document	Bug
WORDSNET-21782	Bottom and Right border is lost after converting Shape to Image	Bug
WORDSNET-22133	Incorrect version of harfbuzz.dll is selected when running app with .NET CLI	Bug
WORDSNET-22348	Extra text is exported after DOCX to HTML conversion	Bug
WORDSNET-22367	Latent Style Information Lost during re-Saving	Bug
WORDSNET-22457	Arabic words not rendering as expected	Bug
WORDSNET-22517	NullReferenceException when exporting to HTML	Bug
WORDSNET-22540	Data labels in the chart messed up when converting to PDF	Bug
WORDSNET-22633	HTML is loaded as plain text	Bug
WORDSNET-22683	‘Culture not supported’ exception is thrown while saving document to PDF	Bug
WORDSNET-22686	In Range.Replace how to ignore cross references	Bug
WORDSNET-22742	System.OverflowException occurs upon loading a XML	Bug
WORDSNET-22750	System.ArgumentException Illegal characters in path MHTML to HTML conversion	Bug
WORDSNET-22757	Shapes do not render correctly after DOCX to PDF conversion	Bug
WORDSNET-22759	Content misplacement issue upon converting DOCX to PDF	Bug
WORDSNET-22765	Doc to Pdf Conversion with HarfBuzz causes ArithmeticOverflowException	Bug
WORDSNET-22772	Extra vertical spacing added in PDF	Bug
WORDSNET-22777	Import of mhtml differs from what is in browser	Bug
WORDSNET-22814	Range.Replace does not replace the numbers when Number ends with line, section and page break	Bug
WORDSNET-22817	Convert the SvgReader output APS geometry into EMUs for further rendering in DML code	Bug
WORDSNET-22821	Space around the barcode is increased after conversion DOC to PDF	Bug
WORDSNET-22828	Extracting Html of a paragraph throws errors	Bug
WORDSNET-22834	Page break at bookmark position does not work	Bug
WORDSNET-22837	Loading encrypted word File Corrupted Exception	Bug
WORDSNET-22840	Missing the image which size is huge in PDF after convert word to PDF	Bug
WORDSNET-22844	Incorrect font substitution for East Asian characters in MathTextElement	Bug
WORDSNET-22847	Fix SonarQube issues	Bug

Full List of Issues Covering all Changes in this Release (Reported by Java Users)

Key	Summary	Category
WORDSNET-16733	Add feature to get the gradient fill properties of a shape	New Feature
WORDSNET-20073	Block level SDT not converting to Editable AcroForm field in PDF	New Feature
WORDSNET-22707	Add an ability to use type members of DataRow etc. classes at LINQ Reporting Engine templates	New Feature
WORDSNET-19181	X-Axis numbering in Chart missing in PDF	Bug
WORDSNET-19183	Chart’s starting ending points are not correct in PDF	Bug
WORDSNET-22044	HTML to Word DOCX visibility of Hidden Numbered and Bullet Lists	Bug
WORDSNET-22050	Allow row to break across pages is checked after DOC-HTML-DOC	Bug
WORDSNET-22573	Shape’s contents are lost after DOCX to PDF Conversion	Bug
WORDSNET-22783	Content converge and cause overlapping in PDF	Bug
WORDSNET-22829	getPageCount() returns incorrect page numbers	Bug
WORDSNET-22848	Text bold formatting is lost after DOCX to PDF	Bug

Public API and Backward Incompatible Changes

This section lists public API changes that were introduced in Aspose.Words 21.11. It includes not only new and obsoleted public methods, but also a description of any changes in the behavior behind the scenes in Aspose.Words which may affect existing code. Any behavior introduced that could be seen as regression and modifies the existing behavior is especially important and is documented here.

Added “USE_ATTACHMENTS” Mode for PDF Rendering

Related issue: WORDSNET-11453

An option has been implemented that allows to display the attachments panel in rendered PDF documents when opened in supporting readers.

The option is specified by setting PdfSaveOptions.page_mode to newly created PdfPageMode.USE_ATTACHMENTS value.

Use Case:

doc = aw.Document("input.docx")

# Specify the new page mode for PDF.
options = aw.saving.PdfSaveOptions();
options.page_mode = aw.saving.PdfPageMode.USE_ATTACHMENTS

doc.save("output.pdf", options)

Added a new ChmLoadOptions class

Related issue: WORDSNET-22513

Currently, this public class contains a single public property original_file_name. This property allows to specify the name of the CHM file. It is useful if a document is loaded from a stream and it contains links that reference the same document by the file name.

Use Case:

# The input document name.
filename = "myfile.chm"

# Create an input stream of the input document.
input_stream = open(filename, "rb")

# Specify the file name in CHM load options.
# This allows Aspose.Words to correctly resolve URLs like "ms-its:myfile.chm::/index.htm"
load_options = aw.loading.ChmLoadOptions()
load_options.original_file_name = filename

# Load the document.
doc = aw.Document(input_stream , load_options)

Added a new FindReplaceOptions.ignore_field_codes property

Related issue: WORDSNET-22686.

Use Case: Explains how to ignore text inside field codes.

# Create document.
doc = aw.Document()
builder = aw.DocumentBuilder(doc)

# Insert field.
builder.insert_field("INCLUDETEXT", "Text in field")

options = aw.replacing.FindReplaceOptions()

# Replace 'T' in document ignoring text inside field code.
options.ignore_field_codes = True
doc.range.replace_regex("T", u"*", options)
print(doc.get_text())

# Replace 'T' in document NOT ignoring text inside field code.
options.ignore_field_codes = False
doc.range.replace_regex("T", u"*", options)
print(doc.get_text())

# The code produces the following output:
# \u0013INCLUDETEXT\u0014*ext in field\u0015\u000c
# \u0013INCLUDE*EX*\u0014*ext in field\u0015\u000c

Added new export_images_as_base64 public option

Related issue: WORDSNET-22787

A new export_images_as_base64 public option has been added in MarkdownSaveOptions class.

Use case: Explains how to save Base64 encoded images directly inside a Markdown document.

doc = aw.Document("DocumentWithShape.docx")

mso = aw.saving.MarkdownSaveOptions()
mso.export_images_as_base64 = True

doc.save("DocumentWithShape.md", mso)

New style identifiers for Mention, Smart Hyperlink, Hashtag and Unresolved Mention have been added

Related issue: WORDSNET-22367

New style identifiers for Mention, Smart Hyperlink, Hashtag and Unresolved Mention have been added.

class StyleIdentifier(IntEnum):
    ...

    # The Mention style.
    MENTION = 372

    # The SmartHyperlink style.
    SMART_HYPERLINK = 373

    # The Hashtag style.
    HASHTAG = 374

    # The UnresolvedMention style.
    UNRESOLVED_MENTION = 375

The styles are undocumented, they do not have their own constants in Micosoft VBA and they are not displayed in MS Word Style Sheet. The styles are added to preserve the document structure invariability, in particular to preserve latent styles when converting Docx to Docx using Aspose.Words.