Conversion to grayscale
Contents
[
Hide
]
In most cases, color is not needed for recognition and can even mislead OCR algorithms. Grayscale allows images to be processed more efficiently, resulting in less specks, cleaner backgrounds, and crisper text than color images. Converting to grayscale can also improve the results of other preprocessing filters, such as automatic deskewing.
Aspose.OCR provides a method for converting an image to grayscale before proceeding with preprocessing or OCR.
Grayscale conversion is automatically performed when applying the median filter.
Grayscale conversion
To convert the image to grayscale, run the image through ToGrayscale
preprocessing filter.
AsposeOCR api = new AsposeOCR();
// Scale the image to twice its original size using bilinear interpolation
PreprocessingFilter filters = new PreprocessingFilter();
filters.add(PreprocessingFilter.ToGrayscale());
// Save preprocessed image to file
BufferedImage imageRes = api.PreprocessImage("source.png", filters);
File outputSource = new File("result.png");
ImageIO.write(imageRes, "png", outputSource);
// Append preprocessing filters to recognition settings
RecognitionSettings recognitionSettings = new RecognitionSettings();
recognitionSettings.setPreprocessingFilters(filters);
// Recognize image
RecognitionResult result = api.RecognizePage("source.png", recognitionSettings);
System.out.println("Recognition result:\n" + result.recognitionText + "\n\n");
Usage scenarios
Grayscale conversion is recommended for the following images:
- Photos.
- Scanned ID cards and other personal documents.
- Full-color scans.
Improvements in recognition accuracy and preprocessing enhancements will be highly dependent on the original image and should be empirically tested.