/    Sign up×
Community /Pin to ProfileBookmark

Convert scanned .jpg/.gif files into HTML…?

Hello JPnyc & Everyone

I’m new to this forum.

Been grappling with a problem and hope someone can give me a hand with it…

I’m trying to convert a book into HTML — Unfortunately, the original manuscript pages are in individually scanned .jpg/.gif format, and these files are all I have to work with. These image files contain not only text, but also diagrams/ drawings/ images/ tables/ mathematical equations/ etc. (I have attached some images for you to have a look)

[B]How do I convert these .jpg/.gif files into HTML pages?[/B]

What are the steps to do this?

Would be grateful for some step-by-step pointers, including recommendations of software (preferably free/shareware) to do this conversion.

I’m proficient in HTML, but the conversion from image to HTML stuff is new to me.

Thanks, guys!

Shoi

[upl-file uuid=2782d814-39b3-4e8d-a981-79e889feff69 size=63kB]image4.jpg[/upl-file]

[upl-file uuid=2fcebc43-faec-4fa4-ac20-eb06f9108ef7 size=16kB]IMAGE12.gif[/upl-file]

[upl-file uuid=1eae9007-05de-467a-98cc-09fd2cc85f85 size=62kB]image23-1.jpg[/upl-file]

to post a comment
HTML

6 Comments(s)

Copy linkTweet thisAlerts:
@TheBearMayOct 06.2006 — Don't know of any way to do what you're asking except use them as images inside of an html page. Certain pages, like the last image, may lend themselves to conversion to TIFF files which then would be eligible for an attempt at OCR (optical character recognition) which might be able to give you usable text.
Copy linkTweet thisAlerts:
@ShoiauthorOct 06.2006 — Thanks, TheBearMay... Appreciate the advice.

I've tried running the jpg & gif files through SimpleOCR

[URL=http://www.simpleocr.com/]http://www.simpleocr.com/[/URL]

Tried to convert to text but the results came back as gibberish!

Shoi
Copy linkTweet thisAlerts:
@TheBearMayOct 06.2006 — With OCR I've had the best luck with converting the files to black and white TIFFs, gray scales and color don't seem to work as well.
Copy linkTweet thisAlerts:
@ray326Oct 06.2006 — The third image has OCR potential. The other two would be a lot easier to do manually, cutting out the sub-images for reuse as needed. I've found with OCR you often get what you pay for, too. There really is a difference in the different OCR products.
Copy linkTweet thisAlerts:
@ShoiauthorOct 07.2006 — Thanks TheBearMay & ray326!

Appreciate your advice.

Can you recommend a reliable OCR software? User-friendly would be helpful.

And one that can at least handle the text conversion portion of the scanned images (while I slice & dice the diagrams/ drawings/ images/ tables/ etc and insert them manually into the html pages).
Copy linkTweet thisAlerts:
@ray326Oct 09.2006 — I use a thing called ABBYY FineReader Sprint on Windows. It's limited but does a pretty good job on a good scan.
×

Success!

Help @Shoi spread the word by sharing this article on Twitter...

Tweet This
Sign in
Forgot password?
Sign in with TwitchSign in with GithubCreate Account
about: ({
version: 0.1.9 BETA 5.20,
whats_new: community page,
up_next: more Davinci•003 tasks,
coming_soon: events calendar,
social: @webDeveloperHQ
});

legal: ({
terms: of use,
privacy: policy
});
changelog: (
version: 0.1.9,
notes: added community page

version: 0.1.8,
notes: added Davinci•003

version: 0.1.7,
notes: upvote answers to bounties

version: 0.1.6,
notes: article editor refresh
)...
recent_tips: (
tipper: @AriseFacilitySolutions09,
tipped: article
amount: 1000 SATS,

tipper: @Yussuf4331,
tipped: article
amount: 1000 SATS,

tipper: @darkwebsites540,
tipped: article
amount: 10 SATS,
)...