Input DataSets:

Project:

Ego-splitting Framework in C++

Project Type:

Implementation-flavor

Paper Link:

https://dl.acm.org/doi/10.1145/3097983.3098054

Code Implementation :

C++, R

Input DataSets:

Data sets are provided int the Inputs directory we have tested our source code on 4 different datasets

a. Test

This is small graph authors have used to explain the framework in the research paper. Graph with 9 nodes and 11 edges.

b. Tv_show edges

Data collected about Facebook pages (November 2017). These datasets represent blue verified Facebook page networks of different categories. Nodes represent the pages and edges are mutual likes among them.

c. DBLP

The DBLP computer science bibliography provides a comprehensive list of research papers in computer science. We construct a co-authorship network where two authors are connected if they publish at least one paper together. Publication venue, e.g, journal or conference, defines an individual ground-truth community; authors who published to a certain journal or conference form a community. We regard each connected component in a group as a separate ground-truth community. We remove the ground-truth communities which have less than 3 nodes. We also provide the top 5,000 communities with highest quality which are described in our paper. As for the network, we provide the largest connected component.

tester_edges

This is small graph authors have used to explain the framework in the research paper. Graph with 10 nodes and 28 edges.

Code Execution

we have implemented our project in C++ from scratch and we have not used any complex libraries any cpp compiler with right configuration would be able to run our script.

please follow the below steps before executing the script

Step1:

verify your G++ version, download it from https://formulae.brew.sh/formula/gcc or for Vs-code https://code.visualstudio.com/docs/languages/cpp

step2:

Set correct Input file path to "file_name" variable in the main function in ego_split.cpp file in order to execute our algorithm on the provided data set and plot get the desired splitted ego's for each node

step3:

Execute the script and save the output to log file as below command

g++ /path/to/file/ego_split.cpp -o /path/to/file/ego_split && /path/to/file/ego_split >> /path/to/log_file/tester_edges.log

Here is a sample output file which will be generated on successful execution of our algorithm

We have uploaded our exectionlogs on the above 4 data sets in the outputs folder

Evaluation of our ego-splitting algorithm

We have uploade a Evalauation.ipynd file in our project. Our evaluation foloows the follow steps,

Step 1:

As our source code is completly developed in C++ we have created some modules in python and pasted our final Overlapping Partitions outupt from logfile to our Ipynb file and the generated communities from it.

Step 2:

Then we have generated communities for our Overlapping partitions generated from each dataset using the below function

Here is a sample output for the communities generated for Tester_edges data set

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
Inputs		Inputs
Outputs		Outputs
R-Implementation		R-Implementation
Evaluation.ipynb		Evaluation.ipynb
README.md		README.md
ego_split.cpp		ego_split.cpp

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Project:

Project Type:

Paper Link:

Code Implementation :

Input DataSets:

a. Test

b. Tv_show edges

c. DBLP

tester_edges

Code Execution

please follow the below steps before executing the script

Step1:

step2:

step3:

Here is a sample output file which will be generated on successful execution of our algorithm

Evaluation of our ego-splitting algorithm

Step 1:

Step 2:

Step 3: For caliculating F1 scores we have used the following algorithms proved in nf1 and cdlib packages of python

About

Releases

Packages

Languages

sairajindroju/Ego-splitting-Framework-from-Non-Overlapping-to-Overlapping-Clusters

Folders and files

Latest commit

History

Repository files navigation

Project:

Project Type:

Paper Link:

Code Implementation :

Input DataSets:

a. Test

b. Tv_show edges

c. DBLP

tester_edges

Code Execution

please follow the below steps before executing the script

Step1:

step2:

step3:

Here is a sample output file which will be generated on successful execution of our algorithm

Evaluation of our ego-splitting algorithm

Step 1:

Step 2:

Step 3: For caliculating F1 scores we have used the following algorithms proved in nf1 and cdlib packages of python

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages