Independent Project: Simplifying and optimizing Kang (2020) with GeoPandas
Simplifying and optimizing Kang (2020) with GeoPandas
Published in 2020, Kang et al. measured the spatial accesibilty of COVID-19 resources in Illinois. A workflow I made for my replication of their study is shown below.
My contributions to the study:
Making code more efficient and easier to read with GeoPandas
Using native GeoPandas methods I improved two functions, pop_centroid and hospital_setting cutting down 5 minutes of combined processing time to less than a second.
Simplifying code for future students
- Removed parallel processing from overlapping_function, measure_acc_par
- Removed the dropdown option that allowed the user to choose their population type, hospitals, and resource. This customization was not providing value other than to show that changes made did not make much difference. It also made code much less readable, and more likely for inconsistency in a replication.
- Wrote detailed comments in every function, describing data structures, methods, and reasoning
- Removed of unneccessary codeblocks
My pre-registration of my reanalysis can be found here.
Our updated paper can be found at this GitHub Repository. You can view it as a HTML Document here.