The study conducts a comprehensive analysis of citation networks focusing on publications by authors affiliated with Egyptian institutions. Leveraging the Semantic Scholar platform and its API, a citation network ana a co-authoship network graphs are constructed to visualize the interconnections among these publications and their authors mainly using the python package for graph analysis (Networkx). The primary objective is to identify influential Egyptian publications and assess the centrality of nodes within the citation network. The findings provide valuable information on the impact of Egyptian publications, offering insights into the scholarly influence of authors associated with Egyptian institutions. This research equips researchers and academics interested in evaluating the impact of Egyptian publications with valuable data for future studies, collaborations, and policy decisions.
Note:
Due to the huge size of the files, you can access our dataset throught this link AlGoNet. Also, all the code related to the web scraping and the cleaning of our dataset are available on this Drive Link
Data of the egyptian papers was collected by sending API requests to semantic scholar website using the API key for a higher requests rate. There are 2 main stages:
- Dataset provides details about the researchers' names and affiliations.
- The dataset comprises information on 13,027 researchers, including both Egyptians and individuals affiliated with Egyptian universities.
- Scraping was done using Python library Selenium.
- Data collected on researchers affiliated with the following universities: Ain Shams University, Cairo University, EJUST University, Alexandria University, Banha University, Assiut University, and Zewail University.
-
Final dataset: 31,508 research papers.
-
Each paper includes at least one researcher from Egypt or affiliated with an Egyptian university.
-
Paper details: title, unique paper ID, publication year, and a list of authors' names, IDs in semantic scholar, citation count and a list of all references titles and corresponding IDs for each.
-
API Requests Limitations:
- The Semantic Scholar API key allows a request rate of 100 requests per second.
- The API returns a maximum of 500 authors' data for each paper.
- The API returns a maximum of 1000 references for each paper.
-
Check any updated info regarding semantic scholar API requests
The following studies were conducted and illustrated in our paper:
- Average Degree Centrality for the all collected paper
- Degree Centrality for Publications (>2015) [O(V+E)]
- Eigenvector Centrality for Publications (>2015) [O(V*(V+E))]
You can explore all the findings and outcomes as well as our interpretaion from our paper, or by running the code in .ipynb file. All of the supplements are attached to this repo.
We would like to express our deepest gratitude to Eng. Zeyad Shokry for his invaluable guidance and assistance with the collection of our dataset (AlGoNet).
[ Mariam Ayman - Sohaila Kandil - Youssef El-harty - Ahmed Hagag - Omar Wassim - Alaa Moheb]