What is the optimal data entry coding scheme of ICD-10 codes that is seemlessly compatible with major statistical packages?

05 May 2018 0 1K Report

Dear researchgate community.

We are currently undertaking a large historical cohort study (N = 3500 records), where we will, among other things, register all hospital admissions and the 3 first ICD-10 codes from the medical reports.

Data entry will be done in SPSS, but several later analyses will be done in Stata/R and others (genetic platforms). The coding scheme in ICD-10 starts with a letter (i for cardiovascular), a main number, and final numbers; I50 is heart failure, I50.9 is unspecified heart failure. This can perhaps most easily be coded as a string variable, as it is. However, this string code might perhaps generate complications later on, moving between statistical platforms, converting files, etc? A more elaborate approach could be to generate a 2-3 variable coding scheme for the letter, the main number, and the sub-number, where all could be in numeric form (a = 1, b = 2, etc). However, this would put some burden on the persons doing the data entry (including me). Does anyone have any experience with data entry coding schemes, using ICD-10 codes in semi-big cohorts and the potential pain they might cause later, moving across statistical platforms?

Greatful for any input

Lasse

Badges
Science topic

More Lasse Giil's questions See All

Orthogonal polynomials to SPSS?

I am developing a Growth model using either GEE (gamma) or generalized mixed effect model(gamma) for psychiatric symptoms in Alzheimer with 6 measurments over 6 consecutive years. There are...

06 July 2016 7,234 4 View

Biostatistics company? If you have issues developing a model, is there an internationally operating statistics company for scientists you can pay for?

With survivable rates for a small and new Research group.

05 June 2016 1,846 0 View

How can I get functional analyses of antibodies to GPCR for a good price?

Dear all. We are studying the pathophysiology og autoantibodies to GPCR that we induce in animal models. In addition to looking at the histological endpoints, we would like to purify the IgG from...

03 April 2015 8,228 2 View

How to learn more about SPSS and its Application?

I would like to learn more about SPSS and Its application especially in regards to data analysis. Please suggest me how I can learn more about it. Thank you so much.

11 August 2024 9,101 4 View

Handling Missing Data and Building a Predictive Model with Incomplete Information ?

I am developing a predictive model for a water supply network that involves 20 influencing points. However, I only have historical data for 10 out of these 20 points. I would like to know how to...

10 August 2024 4,005 2 View

Has anyone applied Python in the field of textile engineering for data analysis, automation, or smart textiles?

I'm currently exploring the application of Python in textile engineering, specifically in areas like data analysis, process automation, and the development of smart textiles. I'm interested in...

10 August 2024 7,429 2 View

What may be the reasons for failures of Tube toi Tube Sheet Joints in Boiler Drum ?

We have observed that tube to tube sheet joint leaked in our boiler and needs to overcome same by knowing the root cause.

08 August 2024 3,161 0 View

Is an invitation to join the editorial board of Clinical Cardiology Updates a scam?

I received an e-mail invitation to join the editorial board of Clinical Cardiology Updates. While I have published a few articles related to cardiovascular disease, there are lots of colleagues...

06 August 2024 8,981 8 View

Is Galaxy.org good to use for research for analyzing data and for publication?

Hello all, I wanted to know, can I use galaxy (USA, Europe or Australia) platform for analyzing the shotgun data, and can it be used for publication purpose as well? Thanks :)

06 August 2024 6,610 4 View

Do experts have journals in the field of artificial intelligence and big data that are not indexed by SCI or EI?

05 August 2024 8,836 2 View

If we are using snowball sampling technique, how do we justify the true representativeness of the sample statistically? is there any statistical test?

Are there any statistical methods to justify your sampling technique using SPSS or AMOS?

05 August 2024 9,153 4 View

What are possible strategies can be used to analyze data under sequential explanatory mixed method approach?

Better ways to analyze the qualitative and quantitative data in a sequential explanatory mixed method approaches

04 August 2024 2,703 6 View

How can I interpret the data without the need of solving it manually?

How can I interpret the data gathered without solving?

03 August 2024 9,054 3 View