Skip to content

Latest commit

 

History

History
66 lines (37 loc) · 3.32 KB

README.md

File metadata and controls

66 lines (37 loc) · 3.32 KB

DocNet

Analysis on a network-model of a group of doctors (nodes) and their social/professional interactions (edges) over a period of time in 4 cities in the US. We attempt to use a varitety of methods from network theory to come to conclusions about how this group performed and behaved and run some very interesting simulations.

Summary of abilities

  • Demonstrates basic modelling of systems by networks for Data Science
  • Covers the implementation of community detection, centrality analyis and compnent/cluster detection in r
  • Shows how to use such methods to draw conclusions from a network
  • Demonstrates how to simulate the spread of information as a disease spreading through a network

Requirements and Installation

The entire project is done in R, can be found here.

Its best to use RStudio in my opinion as an IDE for R, it simply makes most tasks pretty easy and is the most popular IDE for R to the best of my knowledge. Available here.

And to generate the HTML files from the RMD, you'll need the Knitr library, documentation available here. Now the method I used to make use of the external R files relies on the source() function. If you really dont want to use this, then you can achieve the same thing via the read_chunk() function. If you want to knit the files this way, you may have to make small changes to the 5 .R files included. Simply add:

  • @knitr InfectedBetweennessNodes
  • @knitr InfectedJournalReaders
  • @knitr InfectedRandomNodes
  • @knitr SimulateEpidemic
  • @knitr CompSize-NodeRemoval

to the top of each file respectively. It should be fine to work then, and if not, refer to this site for information on how to use external .R files with knitr.

Usage

EDA

Community detection

Giant Component Analysis

Centrality and distruption

Simulating information spread

Conclusions

Contribution

Feel free to use the results or code for your own investigations, and if you want to make improvements or corrections then don't hesitate to let me know! Network theory and its applications are relatively new in mathematics so there is a lot to be done!

Credits

Dr. C. Colijn - Data-set and coursework idea itself was given to us as part of the M34A50 module at Imperial College London, lectured by Dr. Colijn.

License

MIT license, feel free to use and develop for your own investigations!