[WIP] Process bibtex #32

ChasNelson1990 · 2020-09-02T14:00:28Z

I've started to add the ability to use bibtex_keys and process them as part of @dalonsoa's doi & process_reference changes.

Currently this is not fully merged with @dalonsoa's ideas - I'm still trying to understand the best way to do this in my brain because his approach would, I think, mean that I had to parse my .bib file on each call of the processor classes (because the class is being reinitialised inside process_reference()) which I want to avoid.

Any suggestions/comments/commits welcome.

You can't use `[]` as a default argument because then all instances end up with a reference to a single list, causing weird and unexpected bugs.

TODO: make subclass of ProcessorBase

…n/doi [WIP] Use DOI in decorator

ChasNelson1990 · 2020-09-02T14:23:42Z

Thinking about this a bit more... I think @dalonsoa's DOI processing thoughts and my bibtex processing thoughts might both need rethinking to be compatible. Here are my understandings of the competing limitations:

when working with DOIs, each new DOI has to be a request to, for example, a crossref endpoint (e.g. the bibtex endpoint); this means we call our processor object once for each entry in a BIBLIOGRAPHY
thus, we should cache these individual responses so we only ever make one call to internet (currently this is done with @lru_cache)
when working with bibtex, we only want to parse the bibfile once (and thus do not need to cache individual responses)
the obvious way to do this is to store the parsed bib inside the processor object; however, this means we should initialise the object once for a whole BIBLIOGRAPHY

Possible solution: instead of using lru_cache, we store each DOI endpoint in a self.cache dictionary within the processor object (check there first, call crossref if not there); then, for the bib case we just store the parsed bibtex in self.cache instead?

ChasNelson1990 · 2020-09-02T14:29:59Z

Then, thinking about the big picture - one would just call PROCESSOR[args.processor](output_format=args.something) once in __main__.py (before the writer) and the class would initialise and run through all BIBLIOGRAPHY.values() and replace '[doi]something' or '[bibtex]something' with 'My output_formatted string'?

ChasNelson1990 · 2020-09-02T14:46:58Z

Of course, the other way to solve this is not to use a ProcessorBase at all but rather to mirror what has been done in writers.py and just have a defined function for each registered processor.

ChasNelson1990 · 2020-09-02T14:48:14Z

Also, I realise I have currently set-up the args in __main__.py to assume that we only want a single processor - which is of course not always going to be the case. I'll try to fix that now.

Merge pull request ImperialCollegeLondon#28 from ImperialCollegeLondon/doi

…ove-test-coverage Improve test coverage

…n/all-contributors/add-jezcope docs: add jezcope as a contributor

… into ImperialCollegeLondon-develop

…n/all-contributors/add-ChasNelson1990 docs: add ChasNelson1990 as a contributor

…process_bibtex

codecov · 2020-09-02T15:17:31Z

Codecov Report

Merging #32 into process_reference will increase coverage by 20.73%.
The diff coverage is 23.61%.

@@                  Coverage Diff                   @@
##           process_reference      #32       +/-   ##
======================================================
+ Coverage              29.26%   50.00%   +20.73%     
======================================================
  Files                      6        6               
  Lines                    164      224       +60     
======================================================
+ Hits                      48      112       +64     
+ Misses                   116      112        -4

Impacted Files	Coverage Δ
r2t2/__main__.py	`0.00% <0.00%> (ø)`
r2t2/reference_processors.py	`0.00% <0.00%> (ø)`
r2t2/core.py	`96.42% <88.88%> (-1.45%)`	⬇️
r2t2/static_parser.py	`100.00% <100.00%> (+100.00%)`	⬆️
r2t2/__init__.py
r2t2/writers.py	`55.55% <0.00%> (+55.55%)`	⬆️

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update b4620ed...4b8d3c6. Read the comment docs.

Merge PR29

…process_bibtex

jezcope and others added 14 commits September 2, 2020 11:06

Add test for register_writer

dc1b8d8

Add basic tests for static parser

61d6619

Refactor BIBLIOGRAPHY setup/teardown

b6250de

Refactor tests for core.add_reference

4edda2c

Merge branch 'develop' into feature/improve-test-coverage

0865e9e

Add tests for different numbers of refereces

0490441

Fix bug in FunctionReference default values

a09b5f2

You can't use `[]` as a default argument because then all instances end up with a reference to a single list, causing weird and unexpected bugs.

Ignore __init__.py for coverage checking

15baf49

add ability to pass --processor and --bibtex flags to trackers

687812e

add ability to add_reference using bibtex_key

500f583

add BibtexProcessor(source=None) (as function)

5b7cec8

TODO: make subclass of ProcessorBase

add bibtex processor tests

6e293d7

add bibtexparser to setup.py

d64a2d5

Merge pull request ImperialCollegeLondon#28 from ImperialCollegeLondo…

cb65b79

…n/doi [WIP] Use DOI in decorator

jezcope added 3 commits September 2, 2020 15:27

Merge branch 'develop' into feature/improve-test-coverage

753628a

Refactor test_add_reference_from_doi to use new fixtures

c114d84

Run black on tests to fix remaining style issues

6082549

Add test for locate_references globbing

6e2cf6b

ChasNelson1990 and others added 8 commits September 2, 2020 15:51

start to think about handling multiple processors

054ccef

Merge pull request #1 from ImperialCollegeLondon/develop

deecea6

Merge pull request ImperialCollegeLondon#28 from ImperialCollegeLondon/doi

Merge pull request ImperialCollegeLondon#29 from jezcope/feature/impr…

79ac4fb

…ove-test-coverage Improve test coverage

docs: update README.md [skip ci]

d1b3792

docs: update .all-contributorsrc [skip ci]

2cf7b79

docs: update README.md [skip ci]

87d07a0

docs: update .all-contributorsrc [skip ci]

19572b0

Merge pull request ImperialCollegeLondon#35 from ImperialCollegeLondo…

4480f14

…n/all-contributors/add-jezcope docs: add jezcope as a contributor

ChasNelson1990 and others added 5 commits September 2, 2020 16:15

Merge branch 'develop' of https://github.com/ImperialCollegeLondon/R2T2…

89b9752

… into ImperialCollegeLondon-develop

Merge branch 'ImperialCollegeLondon-develop' into process_bibtex

2b3194a

Merge branch 'develop' into all-contributors/add-ChasNelson1990

6cf4557

Merge pull request ImperialCollegeLondon#37 from ImperialCollegeLondo…

5e524c9

…n/all-contributors/add-ChasNelson1990 docs: add ChasNelson1990 as a contributor

Merge branch 'process_bibtex' of github.com:ChasNelson1990/R2T2 into …

31869da

…process_bibtex

ChasNelson1990 and others added 3 commits September 2, 2020 16:17

Merge pull request #2 from ImperialCollegeLondon/develop

e2ca396

Merge PR29

add test_process_reference_from_bibtex

21695ce

Merge branch 'process_bibtex' of github.com:ChasNelson1990/R2T2 into …

4b8d3c6

…process_bibtex

ChasNelson1990 mentioned this pull request Sep 2, 2020

Process reference #30

Closed

dalonsoa mentioned this pull request Sep 2, 2020

[Discussion] Source files and processing references #51

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[WIP] Process bibtex #32

[WIP] Process bibtex #32

ChasNelson1990 commented Sep 2, 2020

ChasNelson1990 commented Sep 2, 2020

ChasNelson1990 commented Sep 2, 2020

ChasNelson1990 commented Sep 2, 2020

ChasNelson1990 commented Sep 2, 2020

codecov bot commented Sep 2, 2020 •

edited

Loading

[WIP] Process bibtex #32

Are you sure you want to change the base?

[WIP] Process bibtex #32

Conversation

ChasNelson1990 commented Sep 2, 2020

ChasNelson1990 commented Sep 2, 2020

ChasNelson1990 commented Sep 2, 2020

ChasNelson1990 commented Sep 2, 2020

ChasNelson1990 commented Sep 2, 2020

codecov bot commented Sep 2, 2020 • edited Loading

Codecov Report

codecov bot commented Sep 2, 2020 •

edited

Loading