solved AimThe purpose of the phase 1 report is to find

AimThe purpose of the phase 1 report is to find a medical condition for which you want to conduct data analyses and to have a basic understanding of the medical condition, such as the background, causes of the disease, treatments, status, and influential factors. You need data to conduct data analyses. Therefore, you need to find a dataset with which you can conduct your project. I strongly encourage you to use the Medical Expenditure Panel Survey (MEPS) data in this class, while I provide a list of potential datasets below. It should be noted that I will provide sample codes only for the MEPS data. For the data analysis, you should have at least 500 data instances. Therefore, please find a medical condition of interest and then check whether there are 500 data instances. More details will be explained below.DatasetMEPS is a set of large-scale surveys of families and individuals, their medical providers, and employers across the United States. MEPS is a complete source of data on the cost and use of health care and health insurance coverage in the US. For more details, please visit the official website (Links to an external site.) and Github repository (Links to an external site.) and read the attached appendix document Download attached appendix document(p.1 – 13). MEPS data consists of various variables such as medical condition, socioeconomic factors (e.g., gender, region, race, and family income), and medical expenditure. MEPS data also consist of various files such as person-level (e.g., health status, demographics, and total $$ of care), event-level (e.g., healthcare service use), and condition-level (e.g., medical condition). For the full review of those variables, please look at codebooks (person-level (Links to an external site.)) and condition-level (Links to an external site.)). I also coded those variables regarding usefulness for analysis (included vs. excluded, Heejun_Inclusion field) and variable type (independent vs. dependent, Heejun_Variable_Type field). You can find my version of the codebook from this link (Links to an external site.). In particular there are some dependent variables you can utilize:Total health expendituresTotal inpatient expendituresTotal emergency care expendituresSeverity of Illness (attacks/year)Number of School Days Missed (Children)Number of Work Days Missed (Adult)You should explore the dataset in depth to understand what you can do and to decide what you will do. It is a complex dataset, and you need to merge a number of files into one for your project. Do not feel overwhelmed. I will introduce all procedures step by step. Depending on the medical condition (e.g., allergic rhinitis), research goals you can set will include but are not limited to:Predict the yearly medical expenditure of persons with allergic rhinitisCompare healthcare costs in different social determinant factors (e.g., sex, region, family income, and race)Find relationships between allergic rhinitis and environmental factorsIf you are skilled in Python and want to use other datasets, then it is up to your group. However, please be noted that I cannot fully support your group, and my sample codes should be tweaked a lot to reflect the difference between the dataset you choose and the MEPS dataset. The following datasets are publicly available and free:National Health and Nutrition Examination Survey (Links to an external site.)Early Childhood Longitudinal Studies Program (Links to an external site.)Add Health (Links to an external site.)FDA Adverse Event Reporting System (FAERS) (Links to an external site.) What to Do for the Phase 1 ReportPlease find a medical condition that interests your group and find an ICD-9 code of the condition from the FINDACODE.COM (Links to an external site.)If your group decides to use the MEPS data, go to #3. If not, go to #5Download a condition-level file (h128.csv Download h128.csv), open the file with MS Excel, and apply a filter to the “ICD9CODX” field. Select the ICD-9 code of your condition (e.g., 493 for asthma) and check the number of records (i.e., number of rows selected) related to your condition. This number would include duplicated recordsTo remove duplicated records and count the number of unique records, Select the “DUPERSID” field and copy all selected DUPERSIDs. While copying, you should copy visible cells only. If not, you are going to copy all DUPERSIDs in the column. To copy “visible cells only”, please follow the instruction found at the Microsoft Support (Links to an external site.)The DUPERSID uniquely identifies each person represented on the file. In other words, DUPERSID is a sample person identifierThen, paste copied DUPERSIDs to another sheet (+ menu in the bottom)Remove duplicates (“Data” -> “Remove Duplicates”) and count the number of records (i.e., DUPERSIDs). If the number of DUPERSIDs is larger than 500, you are good. Congratulations! Your medical condition meets the requirement of the project. If not, you do not have enough data for the project. Unfortunately, you need to find another medical condition and iterate the steps from 3 to 4.D until you can find a condition with more than 500 recordsIf you selected another dataset, it’s your responsibility to conduct steps from 3 to 4.D by yourself. Your workflow should do the same procedures as my suggested workflow doesHow to writePlease write a report in 600-800 (12 point font, double-spaced) words and cite appropriately. You need to have references to introduce the background of your condition. In-text citations and the list of references should follow APA 7th edition style. The report should consist of 4 to 5 paragraphs: introduction, two or three paragraphs of the main body, and conclusion. The topics can include but are not limited to background, causes of the disease, treatments, status, influential factors, and a particular reason of your interest. What to includeYour submission of the report should include:Names of students in your project group (up to three students)EUIDs of group membersTitle of the projectMedical conditionData sourceNumber of data recordsWhat I asked you in the “How to Write”The order of contributions (i.e., work distribution) like authorship (e.g., first author, second author, third author, and so on.)If you believe that some of you or all of you contributed equally, then you need to state itHere, contributions include all phases you worked to submit this project reportPlease attach a .docx file. All group members should submit the same copy of the report.Grading CriteriaYour reports will be evaluated on both the quality and quantity of entries. By quality, I do not mean mechanics (e.g., spelling, grammar), but rather content. Your entries should demonstrate reflective and critical thinking, integration of materials and experiences, development of your own research questions, and general intellectual growth. As to length or quantity, 600-800 words as well as 4 to 5 paragraphs are required. The list of references will not be included in the word count.PreviousNext

Looking for an Assignment Help? Order a custom-written, plagiarism-free paper

Order Now