Skip to content

014. July 30 to Aug 3

aradu12 edited this page Aug 3, 2018 · 3 revisions

Planned tasks for this week

  • task 1: expand data using well-engineered projects ⌛️
  • task 2: work on paper and presentation

Progress

task 1

  • currently using the reaper dataset to find repos
    • a few repos have that same problem where there are lots of stars but few commits, but besides that it's promising
    • looking at those that were hits for the Random-Forest classifier
  • documenting the source as RepoReapers-dataset in yaml files
  • found a problem that fits under a new tag: stability determinism

task 2

  • didn't get to it this week

Other

  • Currently converting the list to yml format and adding the hit count for each repo -- not complete yet

Open Problems

  • new issues opened this week: NONE

Things we discussed/agreed on

  • stability -> determinism
  • getting at least 10 repos of each lang from RepoReaders-dataset
  • getting hit count for repos
  • limit RepoReapers-dataset to projects tagged by Random-Forest classifier

Next steps

  • finish data expansion
  • finish converting list of repos to yml format with hits
  • create on presentation draft
  • update paper