Windows Vista Tips

Windows Vista Tips > Newsgroups > Windows Vista General Discussion > OCR-ed text in TIF files is not indexed?

Reply
Thread Tools Display Modes

OCR-ed text in TIF files is not indexed?

 
 
Jerry
Guest
Posts: n/a

 
      04-07-2007
I have a large document archive consisting of scanned files in TIF format,
where all text has been OCR-ed. With Windows XP I was using the Windows
Desktop Search ver. 2.6 that was neatly handling the OCR-ed text.

After updating to Vista, that is no longer the case, in spite of my having
selected "index properties and file contents" option for TIF files. Is there
a way I can get this to work?

 
Reply With Quote
 
 
 
 
kirk jim
Guest
Posts: n/a

 
      04-07-2007
wait one moment...

if you have OCR'ed the TIF images.. then you have produced text files with
the writting from those images in some text format like txt, doc or rtf.

Those text files can be indexed.

If you want vista to READ the TIF IMAGES and understand the writting on
them, you are out of luck! lol thats not possible



"Jerry" <> wrote in message
news:...
>I have a large document archive consisting of scanned files in TIF format,
>where all text has been OCR-ed. With Windows XP I was using the Windows
>Desktop Search ver. 2.6 that was neatly handling the OCR-ed text.
>
> After updating to Vista, that is no longer the case, in spite of my having
> selected "index properties and file contents" option for TIF files. Is
> there a way I can get this to work?



 
Reply With Quote
 
Andre Da Costa[ActiveWin]
Guest
Posts: n/a

 
      04-07-2007
The text in the image has to be converted to actual text to be indexed.
--
Andre
Blog: http://adacosta.spaces.live.com
My Vista Quickstart Guide:
http://adacosta.spaces.live.com/blog...3DB!9709.entry
"Jerry" <> wrote in message
news:...
>I have a large document archive consisting of scanned files in TIF format,
>where all text has been OCR-ed. With Windows XP I was using the Windows
>Desktop Search ver. 2.6 that was neatly handling the OCR-ed text.
>
> After updating to Vista, that is no longer the case, in spite of my having
> selected "index properties and file contents" option for TIF files. Is
> there a way I can get this to work?



 
Reply With Quote
 
Jerry
Guest
Posts: n/a

 
      04-07-2007
The text is stored in the same TIF file. Done using the Document Imaging
program (part of MS Office).
WDS 2.6 had no problem indexing it.


"kirk jim" <11@11.11> wrote in message
news:OU%...
> wait one moment...
>
> if you have OCR'ed the TIF images.. then you have produced text files with
> the writting from those images in some text format like txt, doc or rtf.
>
> Those text files can be indexed.
>
> If you want vista to READ the TIF IMAGES and understand the writting on
> them, you are out of luck! lol thats not possible
>
>
>
> "Jerry" <> wrote in message
> news:...
>>I have a large document archive consisting of scanned files in TIF format,
>>where all text has been OCR-ed. With Windows XP I was using the Windows
>>Desktop Search ver. 2.6 that was neatly handling the OCR-ed text.
>>
>> After updating to Vista, that is no longer the case, in spite of my
>> having selected "index properties and file contents" option for TIF
>> files. Is there a way I can get this to work?

>
>


 
Reply With Quote
 
kirk jim
Guest
Posts: n/a

 
      04-07-2007
Jerry,

a tiff file is normally only an image file...

however I know that you can add a layer of text on it and annotations,
because you can do that on XP with the document fax and imaging viewer.

However if you view that tiff image with another viewer like irfanview the
annotations are not viewable....

So I dont know what to advise you... perhaps MS thought it was no longer a
standard , and removed that capability from the indexing ?



"Jerry" <> wrote in message
news:B5C0A027-0708-4B30-889F-...
> The text is stored in the same TIF file. Done using the Document Imaging
> program (part of MS Office).
> WDS 2.6 had no problem indexing it.
>
>
> "kirk jim" <11@11.11> wrote in message
> news:OU%...
>> wait one moment...
>>
>> if you have OCR'ed the TIF images.. then you have produced text files
>> with the writting from those images in some text format like txt, doc or
>> rtf.
>>
>> Those text files can be indexed.
>>
>> If you want vista to READ the TIF IMAGES and understand the writting on
>> them, you are out of luck! lol thats not possible
>>
>>
>>
>> "Jerry" <> wrote in message
>> news:...
>>>I have a large document archive consisting of scanned files in TIF
>>>format, where all text has been OCR-ed. With Windows XP I was using the
>>>Windows Desktop Search ver. 2.6 that was neatly handling the OCR-ed text.
>>>
>>> After updating to Vista, that is no longer the case, in spite of my
>>> having selected "index properties and file contents" option for TIF
>>> files. Is there a way I can get this to work?

>>
>>

>



 
Reply With Quote
 
Guest
Posts: n/a

 
      04-07-2007
It used an Office Filter, install Office 2003 or 2007.,
"kirk jim" <11@11.11> wrote in message
news:...
> Jerry,
>
> a tiff file is normally only an image file...
>
> however I know that you can add a layer of text on it and annotations,
> because you can do that on XP with the document fax and imaging viewer.
>
> However if you view that tiff image with another viewer like irfanview the
> annotations are not viewable....
>
> So I dont know what to advise you... perhaps MS thought it was no longer a
> standard , and removed that capability from the indexing ?
>
>
>
> "Jerry" <> wrote in message
> news:B5C0A027-0708-4B30-889F-...
>> The text is stored in the same TIF file. Done using the Document Imaging
>> program (part of MS Office).
>> WDS 2.6 had no problem indexing it.
>>
>>
>> "kirk jim" <11@11.11> wrote in message
>> news:OU%...
>>> wait one moment...
>>>
>>> if you have OCR'ed the TIF images.. then you have produced text files
>>> with the writting from those images in some text format like txt, doc or
>>> rtf.
>>>
>>> Those text files can be indexed.
>>>
>>> If you want vista to READ the TIF IMAGES and understand the writting on
>>> them, you are out of luck! lol thats not possible
>>>
>>>
>>>
>>> "Jerry" <> wrote in message
>>> news:...
>>>>I have a large document archive consisting of scanned files in TIF
>>>>format, where all text has been OCR-ed. With Windows XP I was using the
>>>>Windows Desktop Search ver. 2.6 that was neatly handling the OCR-ed
>>>>text.
>>>>
>>>> After updating to Vista, that is no longer the case, in spite of my
>>>> having selected "index properties and file contents" option for TIF
>>>> files. Is there a way I can get this to work?
>>>
>>>

>>

>
>


 
Reply With Quote
 
Jerry
Guest
Posts: n/a

 
      04-07-2007
Text in the TIF format *is* part of the standard. Incidentally, the spec
fathers were Microsoft and Aldus (now part of Adobe).
MS Office comes with MODI iFilters for TIF and MDI formats, precisely in
order to allow indexing of everything, metadata/text included.
I suspect something must be wrong with some obscure setting buried deep in
the Registry :-)


"kirk jim" <11@11.11> wrote in message
news:...
> Jerry,
>
> a tiff file is normally only an image file...
>
> however I know that you can add a layer of text on it and annotations,
> because you can do that on XP with the document fax and imaging viewer.
>
> However if you view that tiff image with another viewer like irfanview the
> annotations are not viewable....
>
> So I dont know what to advise you... perhaps MS thought it was no longer a
> standard , and removed that capability from the indexing ?
>
>
>
> "Jerry" <> wrote in message
> news:B5C0A027-0708-4B30-889F-...
>> The text is stored in the same TIF file. Done using the Document Imaging
>> program (part of MS Office).
>> WDS 2.6 had no problem indexing it.
>>
>>
>> "kirk jim" <11@11.11> wrote in message
>> news:OU%...
>>> wait one moment...
>>>
>>> if you have OCR'ed the TIF images.. then you have produced text files
>>> with the writting from those images in some text format like txt, doc or
>>> rtf.
>>>
>>> Those text files can be indexed.
>>>
>>> If you want vista to READ the TIF IMAGES and understand the writting on
>>> them, you are out of luck! lol thats not possible
>>>
>>>
>>>
>>> "Jerry" <> wrote in message
>>> news:...
>>>>I have a large document archive consisting of scanned files in TIF
>>>>format, where all text has been OCR-ed. With Windows XP I was using the
>>>>Windows Desktop Search ver. 2.6 that was neatly handling the OCR-ed
>>>>text.
>>>>
>>>> After updating to Vista, that is no longer the case, in spite of my
>>>> having selected "index properties and file contents" option for TIF
>>>> files. Is there a way I can get this to work?
>>>
>>>

>>

>
>


 
Reply With Quote
 
kirk jim
Guest
Posts: n/a

 
      04-07-2007
do you have office installed now?



"Jerry" <> wrote in message
news:279E56B6-AE84-43E9-AD7D-...
> Text in the TIF format *is* part of the standard. Incidentally, the spec
> fathers were Microsoft and Aldus (now part of Adobe).
> MS Office comes with MODI iFilters for TIF and MDI formats, precisely in
> order to allow indexing of everything, metadata/text included.
> I suspect something must be wrong with some obscure setting buried deep in
> the Registry :-)
>
>
> "kirk jim" <11@11.11> wrote in message
> news:...
>> Jerry,
>>
>> a tiff file is normally only an image file...
>>
>> however I know that you can add a layer of text on it and annotations,
>> because you can do that on XP with the document fax and imaging viewer.
>>
>> However if you view that tiff image with another viewer like irfanview
>> the annotations are not viewable....
>>
>> So I dont know what to advise you... perhaps MS thought it was no longer
>> a
>> standard , and removed that capability from the indexing ?
>>
>>
>>
>> "Jerry" <> wrote in message
>> news:B5C0A027-0708-4B30-889F-...
>>> The text is stored in the same TIF file. Done using the Document Imaging
>>> program (part of MS Office).
>>> WDS 2.6 had no problem indexing it.
>>>
>>>
>>> "kirk jim" <11@11.11> wrote in message
>>> news:OU%...
>>>> wait one moment...
>>>>
>>>> if you have OCR'ed the TIF images.. then you have produced text files
>>>> with the writting from those images in some text format like txt, doc
>>>> or rtf.
>>>>
>>>> Those text files can be indexed.
>>>>
>>>> If you want vista to READ the TIF IMAGES and understand the writting on
>>>> them, you are out of luck! lol thats not possible
>>>>
>>>>
>>>>
>>>> "Jerry" <> wrote in message
>>>> news:...
>>>>>I have a large document archive consisting of scanned files in TIF
>>>>>format, where all text has been OCR-ed. With Windows XP I was using the
>>>>>Windows Desktop Search ver. 2.6 that was neatly handling the OCR-ed
>>>>>text.
>>>>>
>>>>> After updating to Vista, that is no longer the case, in spite of my
>>>>> having selected "index properties and file contents" option for TIF
>>>>> files. Is there a way I can get this to work?
>>>>
>>>>
>>>

>>
>>

>



 
Reply With Quote
 
kirk jim
Guest
Posts: n/a

 
      04-07-2007
> fathers were Microsoft and Aldus (now part of Adobe).

I read about the tiff file specs on wikipedia after you asked.... so I know
the story now


"Jerry" <> wrote in message
news:279E56B6-AE84-43E9-AD7D-...
> Text in the TIF format *is* part of the standard. Incidentally, the spec
> fathers were Microsoft and Aldus (now part of Adobe).
> MS Office comes with MODI iFilters for TIF and MDI formats, precisely in
> order to allow indexing of everything, metadata/text included.
> I suspect something must be wrong with some obscure setting buried deep in
> the Registry :-)
>
>
> "kirk jim" <11@11.11> wrote in message
> news:...
>> Jerry,
>>
>> a tiff file is normally only an image file...
>>
>> however I know that you can add a layer of text on it and annotations,
>> because you can do that on XP with the document fax and imaging viewer.
>>
>> However if you view that tiff image with another viewer like irfanview
>> the annotations are not viewable....
>>
>> So I dont know what to advise you... perhaps MS thought it was no longer
>> a
>> standard , and removed that capability from the indexing ?
>>
>>
>>
>> "Jerry" <> wrote in message
>> news:B5C0A027-0708-4B30-889F-...
>>> The text is stored in the same TIF file. Done using the Document Imaging
>>> program (part of MS Office).
>>> WDS 2.6 had no problem indexing it.
>>>
>>>
>>> "kirk jim" <11@11.11> wrote in message
>>> news:OU%...
>>>> wait one moment...
>>>>
>>>> if you have OCR'ed the TIF images.. then you have produced text files
>>>> with the writting from those images in some text format like txt, doc
>>>> or rtf.
>>>>
>>>> Those text files can be indexed.
>>>>
>>>> If you want vista to READ the TIF IMAGES and understand the writting on
>>>> them, you are out of luck! lol thats not possible
>>>>
>>>>
>>>>
>>>> "Jerry" <> wrote in message
>>>> news:...
>>>>>I have a large document archive consisting of scanned files in TIF
>>>>>format, where all text has been OCR-ed. With Windows XP I was using the
>>>>>Windows Desktop Search ver. 2.6 that was neatly handling the OCR-ed
>>>>>text.
>>>>>
>>>>> After updating to Vista, that is no longer the case, in spite of my
>>>>> having selected "index properties and file contents" option for TIF
>>>>> files. Is there a way I can get this to work?
>>>>
>>>>
>>>

>>
>>

>



 
Reply With Quote
 
Frank
Guest
Posts: n/a

 
      04-07-2007
kirk jim wrote:


I read about the tiff file specs on wikipedia after you asked.... so I
know the story now

You read about something on wikipedia?
****...that explains everything.
Never mind.
Frank
 
Reply With Quote
 
 
 
Reply

Thread Tools
Display Modes

Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

BB code is On
Smilies are On
[IMG] code is On
HTML code is Off
Trackbacks are On
Pingbacks are On
Refbacks are Off


Similar Threads
Thread Thread Starter Forum Replies Last Post
Offline Files not indexed? Richard Perry Windows Vista General Discussion 16 03-16-2008 02:45 AM
Why is text not indexed? Jerry Windows Vista General Discussion 2 02-20-2007 03:46 AM
TIF files not indexed Jerry Windows Vista General Discussion 0 02-19-2007 06:57 AM
TIF files not indexed Jerry Windows Vista General Discussion 0 02-18-2007 06:52 PM
Searching within non-indexed files. auser Windows Vista General Discussion 1 12-21-2006 04:16 PM



1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59