JFK RESEARCH TOOLS: I've created a simple process for downloading all the JFK documents and using Windows as a database to search them. - The Great Awakening

264

JFK RESEARCH TOOLS: I've created a simple process for downloading all the JFK documents and using Windows as a database to search them. 🧐 I.T. Help Desk 🤔

posted 6 days ago by Mister_Winston 6 days ago by Mister_Winston +264 / -0

It's 2:30 AM, so I may forget some details in these steps. I've spent all night researching and testing free tools that could accomplish this objective. Feel free to substitute whatever tools you prefer.

The instructions below are for Windows and Edge. You can use Chrome too.

These are the steps for downloading all the JFK documents. This is the easy part.

Create two Windows folders to store the PDF's. The first you can name "JFK Files." Within that folder, create another named "OCR."
Install the Chrono Download Manager extension for either Edge or Chrome:

Edge: https://microsoftedge.microsoft.com/addons/detail/chrono-download-manager/klnjihiihbjoehcifggognefbkkfpfhj

Chrome: https://chromewebstore.google.com/detail/chrono-download-manager/mciiogijehkdemklbdcbfkefimifhecn

After the extension is installed, change your browser's download location to the "JFK Files" folder you just created. Chrono Downloader conveniently pops up a window with a link to change this along with another setting.
Open the browser and pin the Chrono sniffer to the toolbar. To do this in Edge, first click the Extensions button to the right of the URL bar. You'll see the Chrono Downloader extension with three little dots next to it. Click the dots, then click "Pin to toolbar." You'll then see the Chrono icon next to the URL bar. This is the Chrono Sniffer.
Go to the National Archives page for the JFK files. Where the dropdown says "Show 10 entries," change the dropdown to "Show All Entries." You want all 1,123 PDF links to be visible. Don't worry, the Archives website is FAST.

https://www.archives.gov/research/jfk/release-2025

Click the Chrono Sniffer Icon. Click the Document tab. You will see a lot of links listed in the sniffer, so below that list, click the PDF filter. Only the PDF links will be highlighted and have a check mark, meaning only those files will be downloaded. Click the Download All button. This will start the download of all 1,123 files. It took me less than 10 minutes to complete all the downloads over a fiber connection.

You can view the download progress in the Chrono dashboard if you want, but you can also view it in your JFK Files folder.

After all the PDF's are downloaded, you'll want to enhance the files with OCR (Optical Character Recognition) so they can be searchable. If you don't have Adobe Acrobat, there's a free tool that can do this in bulk.

Go to the PDF24 Creator site and download the latest Windows revision, 11.23.0. Note that this free application offers capabilities that other developers charge a lot of money for, like batch processing of PDF's.

https://tools.pdf24.org/en/creator

When you first launch the tool, you will need to register it by creating an account with your email and password (Booooo!), then you will need to register it using the code you received in your email.
After registration, when see the large menu of PDF options, click PDF OCR.
You'll see a page with many lines like a ledger. On the right, change the Output directory to the "OCR" folder you created within the "JFK Files" folder.
On the left, you can now click the Add Files button. Add as many PDF's as you want. For now, I'm doing 10 at a time, and tomorrow I'll test 50 at a time. I'm not sure if the software has a limit. Converting all 1,123 files should take a few hours.

Here's the beautiful part: Once you've enhanced all the PDF's with OCR, you can now search all of them through Windows explorer. To do this, click into the OCR folder and enter your search term in the Windows explorer search field. For example, if I enter "Oswald" in the search field, Windows will list every PDF that contains that word along with some preview text. So your OCR folder is now a database of declassified JFK files.

36 comments

36 comments share save hide report block hide replies

Comments (36)

sorted by:

▲ 17 ▼

– bubble_bursts 17 points 6 days ago +17 / -0

Alternative for Mac and Linux users:

Here is a script to download all the files using curl (should work on Linux and Mac as long as curl is installed)

https://pastebin.com/raw/jtNkkNWz

permalink save report block reply

▲ 7 ▼

– Space_Monkey 7 points 6 days ago +7 / -0

Thank you!

Instructions for those who don't know how to use this:

Create a new folder
Inside the folder create a new file and name it "download.sh"
Paste the script into the file and save it.
Make it executable either by right-clicking, going to Properties and click executable under permissions, or type into the terminal:

chmod +x download.sh

then type:

./download.sh

And it will download everything.

permalink parent save report block reply

▲ 2 ▼

– bubble_bursts 2 points 6 days ago +2 / -0

Thanks fren!

permalink parent save report block reply

▲ 5 ▼

– BGridd 5 points 6 days ago +5 / -0

Curl is also installed with windows10/11 usually as well, for information.

permalink parent save report block reply

▲ 1 ▼

– bubble_bursts 1 point 6 days ago +1 / -0

Yes, it should work if you saveit as a batch file, but was not too sure whether curl on windows works exactly the same as on linux.

permalink parent save report block reply

▲ 3 ▼

– dec3169 3 points 6 days ago +3 / -0

I used the powershell script Cats5 posted earlier and changed the directory to a linux path. Then I did "sudo snap install powershell --classic" and ran the ps script inside of linux. Powershell is crap and convoluted, but in linux I just did "powershell scriptname.ps1" and it worked. I was surprised.

permalink parent save report block reply

▲ 1 ▼

– bubble_bursts 1 point 6 days ago +1 / -0

I actually like PS on windows - even though its slow its very powerful. You can do anything that you can do via GUI, which makes it very good for scripting. Never knew thre was a powershell on linux!

permalink parent save report block reply

▲ 3 ▼

– Trump1234KAG 3 points 6 days ago +3 / -0

Bless you for this script.

TBH, its creation must have been a lot of tedium...

permalink parent save report block reply

▲ 2 ▼

– bubble_bursts 2 points 6 days ago +2 / -0

Not really. I used a macro on emacs. I have a motto for coding. "If it cant be done with emacs macros, you better be getting paid for doing it" !

permalink parent save report block reply

▲ 1 ▼

– Trump1234KAG 1 point 5 days ago +1 / -0

As long as you're parsing it through the HTML, I agree that it wouldn't be too bad. Just the act of stripping away all the crud to get just the file names. Then "for line in file, append "curl phrase" . Just tedium...

permalink parent save report block reply

▲ 3 ▼

– Deaf_MAGA_Pede 3 points 6 days ago +3 / -0

Thanks for this. I spent like 4 hours trying to get wget to work with different options and searching online for which options to use because I kept getting a 404 on accessing the ../2025/0318/ subdirectory for some reason.

I didn't think to use curl cuz I thought if wget couldn't work, curl probably won't work.

permalink parent save report block reply

View 1 more comment

▲ 5 ▼

– Trumpis1sexyman 5 points 6 days ago +6 / -1

Thanks loads. Gonna have hubby do it. I have to vacuum my downstairs tomorrow!

permalink save report block reply

▲ 6 ▼

– Mister_Winston [S] 6 points 6 days ago +6 / -0

You have the best name on GAW since "Libtards R Stoopid (really stoopid)".

permalink parent save report block reply

▲ 5 ▼

– Trumpis1sexyman 5 points 6 days ago +5 / -0

Thank you. I truly believe my name. Was very easy to pick! My hubby got his feelings hurt for about 20 seconds!

permalink parent save report block reply

▲ 4 ▼

– bubble_bursts 4 points 6 days ago +4 / -0

Your hubby is the real hero in all this!

permalink parent save report block reply

▲ 1 ▼

– dec3169 1 point 6 days ago +1 / -0

Get a robot vacuum and save yourself some free time.

permalink parent save report block reply

▲ 4 ▼

– Lawjic 4 points 6 days ago +4 / -0

OCR will not work on a significant portion of the files. I took a random sample of a few dozen files and the text is often blurry, extremely faint, covered in handwritten scribbles, or simply hand written.

permalink save report block reply

▲ 6 ▼

– Mister_Winston [S] 6 points 6 days ago +6 / -0

I noticed that too. I intend to use Photoshop filters on some of the blurry ones.

permalink parent save report block reply

▲ 4 ▼

– p8riot 4 points 6 days ago +4 / -0

There's a PDF with almost 700 pages that I believe is of great significance and is meant for us to be the detectives. Most of it is newspaper clippings and handwritten notes, so while your method is useful it will miss so much of significance. Much of this will have to be done by hand.

If there was some way to crowd source this, so we can optimize the time spent and reduce redundancy, and pool the work together into an online database. I'm not that technologically savvy to organize it.

This is the document I mention at the beginning https://www.archives.gov/files/research/jfk/releases/2025/0318/157-10014-10242.pdf

permalink save report block reply

▲ 2 ▼

– Godisglory1 2 points 6 days ago +2 / -0

Thank you. It is easier this way.

permalink parent save report block reply

▲ 2 ▼

– Mister_Winston [S] 2 points 6 days ago +2 / -0

Yep, this is why old fashioned human intelligence will never be obsolete.

permalink parent save report block reply

▲ 4 ▼

– SarMega 4 points 6 days ago +4 / -0

Or maybe just download them all and then torrent them and put the magnet link to the zip file here?

permalink save report block reply

▲ 2 ▼

– deleteme1234 2 points 6 days ago +2 / -0

This ^

permalink parent save report block reply

▲ 1 ▼

– WyoHighwayman 1 point 6 days ago +1 / -0

This isn't anywhere near all of them. They are digitizing and uploading daily for the foreseeable future.

permalink parent save report block reply

▲ 4 ▼

– Archon69 4 points 6 days ago +4 / -0

If one person can download and prepare the data properly, then put it on a torrent and people can click and get everything.

permalink save report block reply

▲ 4 ▼

– Mister_Winston [S] 4 points 6 days ago +4 / -0

That is likely already happening somewhere. We just like to remain ahead of the normie curve.

permalink parent save report block reply

▲ 3 ▼

– TaQo 3 points 6 days ago +3 / -0

Thanks for this!!!

u/#YouAreAmazing

permalink save report block reply

▲ 3 ▼

– TopKek 3 points 6 days ago +3 / -0

This is awesome! Thank you!

permalink save report block reply

▲ 3 ▼

– Kunkussion 3 points 6 days ago +3 / -0

Appreciate your hard work, I'll bookmark this post

permalink save report block reply

▲ 3 ▼

– Godisglory1 3 points 6 days ago +3 / -0

Me too

permalink parent save report block reply

▲ 2 ▼

– Deaf_MAGA_Pede 2 points 6 days ago +2 / -0

I've been trying to download everything with the wget command but I kept getting an error (404). Tried many different options but nothing, other than the fact that I was able to download the entire website. Had to terminate that before my HDD gets full cuz I'm sure it would have resulted in me downloading about 50 TB of files due to wget trying to download from other subdirectories.

Since I'm on Linux, I guess I'll try curl.

But thank you for this, this would be great for others who are not familiar with wget/curl but I'm a bit miffed that I couldn't get wget to work or maybe I successfully did it but archives.gov is blocking wget commands?

permalink save report block reply

▲ 1 ▼

– PhDinNY 1 point 6 days ago +1 / -0

Can someone who has done this please search for that death certificate/coroner's report that was posted the other day that stated there was an entry wound from the front.

permalink save report block reply

▲ 2 ▼

– lostmyeffingpassword 2 points 6 days ago +2 / -0

It doesn't say that, it says "Multiple gunshot wounds of the head and neck" and "Shot by a high powered rifle"

Multiple wounds from a high powered rifle? Hmmmm.....

permalink parent save report block reply

▲ 1 ▼

– PhDinNY 1 point 5 days ago +1 / -0

I thought the common story was that Oswald was able to get two shots that hit JFK, but that people disputed he could have been able to do that with a bolt action rifle in the time required, so multiple gunshots would not be a new revelation?

permalink parent save report block reply

▲ 1 ▼

– Mister_Winston [S] 1 point 6 days ago +1 / -0

I'll do that tonight or tomorrow. I dumped all the PDF's into PDF24 at the same time, and it's about halfway finished OCR'ing them.

permalink parent save report block reply