Dilation
Some images, such as store receipts, have very thin characters that may be damaged by automatic contrast corrections or binarization.
Aspose.OCR provides a special preprocessing filter called dilation that can increase the thickness of characters in an image by adding pixels to the edges of high-contrast objects, such as letters.
Dilation
To increase the thickness of characters in an image, run the image through BinarizeAndDilate
preprocessing filter.
AsposeOCR api = new AsposeOCR();
// Scale the image to twice its original size using bilinear interpolation
PreprocessingFilter filters = new PreprocessingFilter();
filters.add(PreprocessingFilter.BinarizeAndDilate());
// Save preprocessed image to file
BufferedImage imageRes = api.PreprocessImage("source.png", filters);
File outputSource = new File("result.png");
ImageIO.write(imageRes, "png", outputSource);
// Append preprocessing filters to recognition settings
RecognitionSettings recognitionSettings = new RecognitionSettings();
recognitionSettings.setPreprocessingFilters(filters);
// Recognize image
RecognitionResult result = api.RecognizePage("source.png", recognitionSettings);
System.out.println("Recognition result:\n" + result.recognitionText + "\n\n");
Image regions preprocessing
BinarizeAndDilate
filter can be applied to specific regions of an image. For example, you can make the article body text thicker while leaving the headers unchanged.
To apply a filter to an area, specify its top left corner along with width and height as Rectangle
object. If the region is omitted, the filter is applied to the entire image.
Rectangle rectangle = new Rectangle(5, 161, 340, 340);
PreprocessingFilter filters = new PreprocessingFilter();
filters.add(PreprocessingFilter.BinarizeAndDilate(rectangle));
Usage scenarios
Dilation is recommended for the following images:
- Receipts.
- Printouts with very thin font.