OK, so this is another odd thing about this... search for "Comey" (with an 'm') and you get 1 occurrence. Search for "Corney" ('r') and you do get about 50 hits...
BUT!! ...in each case where it finds (Ctrl+F) an occurrence of "Corney" it is spelled "Comey" on the page.
Perhaps it has something to do with "kerning"... manipulating the space between characters to force the 'r' and 'n' together to appear as something else. Now that I think about it, you can do the same thing in Word... take any misspelled word, set the kerning to 0 in the desired letters, and then copy-pasta (or find-and-replace) that word throughout the document.
So it's established that it CAN be done, but the question now is "Why?"
damn, this is correct. I just copy pasta'd directly from the PDF doc into a text editor and even though it visually looks like "Comey" the characters are "Corney"
I think it's scanned and there's an OCR pass done on it to provide the ability to search, etc, and that OCR has misstaken a "m" for an "rn". Try searching for "inforrnation" or "ernail", you get a few hits for those as well.
OK, so this is another odd thing about this... search for "Comey" (with an 'm') and you get 1 occurrence. Search for "Corney" ('r') and you do get about 50 hits...
BUT!! ...in each case where it finds (Ctrl+F) an occurrence of "Corney" it is spelled "Comey" on the page.
How does a PDF document do that?
it's the font tricking you. it's not that hard to customize a font to make r + n look like m
Perhaps it has something to do with "kerning"... manipulating the space between characters to force the 'r' and 'n' together to appear as something else. Now that I think about it, you can do the same thing in Word... take any misspelled word, set the kerning to 0 in the desired letters, and then copy-pasta (or find-and-replace) that word throughout the document.
So it's established that it CAN be done, but the question now is "Why?"
damn, this is correct. I just copy pasta'd directly from the PDF doc into a text editor and even though it visually looks like "Comey" the characters are "Corney"
This is interesting.
Why the f would they do that?
Are they comms?
Are there more comms in this doccument?
Zero kerning.
When OCR happens on a printed document, the computer will embed what it "thinks" the characters are. So it's metadata behind the scenes.
Also people can F around with that metadata and make "corrections" to the text if necessary.
I think it's scanned and there's an OCR pass done on it to provide the ability to search, etc, and that OCR has misstaken a "m" for an "rn". Try searching for "inforrnation" or "ernail", you get a few hits for those as well.