Script uses data from StackOverflow to show relations between users and tags. Main dependencies:
- stackapi - crawling data
- graphistry - drawing data as a graph
- pyarrow, pandas and numpy - transforming data
TSVis gathers Q&A threads with the tags provided by you. In next step, it collects all users who answered in those threads. Gathered data is used to build a list of tags for each user. StackOverflow provides information for particular user about tags he/she was active in and we use it to prepare list of relations: user -> tag. At final stage those relations are drawn on the graph in Graphistry.
You need at least Python 3.7 to run it. Script takes list of tags (separated by semi-colon).
git clone [email protected]:data-hunters/tech-skills-visualizer.git
cd tech-skills-visualizer
pip3 install -r requirements.txt
It's just an example of running the script but it's better idea to put environment variables (especially GRAPHISTRY_PASSWD
) to file and load it.
export PYTHONPATH="${PYTHONPATH}:."
export GRAPHISTRY_USER=<GRAPHISTRY_USER>
export GRAPHISTRY_PASSWD=<GRAPHISTRY_PASSWORD>
# OPTIONAL: export STACK_API_KEY=<STACKEXCHANGE_API_KEY>
python3 tsvis/run.py --tags=bigdata --max-pages=10
Run python3 tsvis/run.py --help
to check other options.