OCR processing millions of images on Amazon’s EC2

Note: I wrote this post nearly 2 years ago and recently discovered it in my drafts, some of the info is outdated. Recently, I was tasked with running OCR on a huge set of images (3.4 million.) I’m going to post some brief details on how we processed these images in about a week. Initially,… Continue reading OCR processing millions of images on Amazon’s EC2