Here are three files (this link expires in 25 days; plz lemme know if it doesn't work). The .xlsx file has the raw data, the .txt file is the German ICD codes, and the .csv file contains the results, sorted in descending order by -log10(p-value).
If anyone knows what "code" and "nocode" means in the data, please reply.
Analysis steps:
Stack the data and add together the code and nocode numbers for each quarter and ICD-10 code, and then take log2 of this sum to standardize it.
Join in the German ICD-10 code definitions (they are a bit different from other ICD-10 versions).
Split the data into wide form with one row per quarter, creating 15,650 columns for each of the ICD-10-GM code definitions
Create a binary variable "covid19_vax" with value 0 for all quarters prior to 2021 and 1 for all 2021 and 2022 quarters.
Perform 15,650 t-tests with the binary variable and adjust for multiple testing using false discovery rate (FDR).
Screen for effects that are positive and significant according to FDR.
I'm using JMP software (Response Screening platform), which is way more convenient than Excel and great for graphics and stats without any coding.
The signals light up like a Christmas tree with hundreds of significant differences, including the R9* death codes in the OP plot.
Previous discussion on the German KBV data
Here are three files (this link expires in 25 days; plz lemme know if it doesn't work). The .xlsx file has the raw data, the .txt file is the German ICD codes, and the .csv file contains the results, sorted in descending order by -log10(p-value).
If anyone knows what "code" and "nocode" means in the data, please reply.
Analysis steps:
Stack the data and add together the code and nocode numbers for each quarter and ICD-10 code, and then take log2 of this sum to standardize it.
Join in the German ICD-10 code definitions (they are a bit different from other ICD-10 versions).
Split the data into wide form with one row per quarter, creating 15,650 columns for each of the ICD-10-GM code definitions
Create a binary variable "covid19_vax" with value 0 for all quarters prior to 2021 and 1 for all 2021 and 2022 quarters.
Perform 15,650 t-tests with the binary variable and adjust for multiple testing using false discovery rate (FDR).
Screen for effects that are positive and significant according to FDR.
I'm using JMP software (Response Screening platform), which is way more convenient than Excel and great for graphics and stats without any coding.
The signals light up like a Christmas tree with hundreds of significant differences, including the R9* death codes in the OP plot.