less than 1 minute read

Simplifying and optimizing Kang (2020) with GeoPandas

Published in 2020, Kang et al. measured the spatial accesibilty of COVID-19 resources in Illinois. A workflow I made for my replication of their study is shown below.

workflow

My contributions to the study:

Making code more efficient and easier to read with GeoPandas

Using native GeoPandas methods I improved two functions, pop_centroid and hospital_setting cutting down 5 minutes of combined processing time to less than a second.

Simplifying code for future students

  • Removed parallel processing from overlapping_function, measure_acc_par
  • Removed the dropdown option that allowed the user to choose their population type, hospitals, and resource. This customization was not providing value other than to show that changes made did not make much difference. It also made code much less readable, and more likely for inconsistency in a replication.
  • Wrote detailed comments in every function, describing data structures, methods, and reasoning
  • Removed of unneccessary codeblocks

My pre-registration of my reanalysis can be found here.

Our updated paper can be found at this GitHub Repository. You can view it as a HTML Document here.