Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

ARXIVNG-2885: Implement support for TeX Produced check #77

Open
wants to merge 23 commits into
base: develop
Choose a base branch
from
Open
Show file tree
Hide file tree
Changes from 18 commits
Commits
Show all changes
23 commits
Select commit Hold shift + click to select a range
8ab8ea9
Resolved merge conflicts
DavidLFielding Feb 25, 2020
5f20d51
Tests for tex_produced package. [ARXIVNG-2885] David
DavidLFielding Feb 27, 2020
34abcff
Test files for tex_produced test suite. [ARXIVNG-2885] David
DavidLFielding Feb 27, 2020
647be11
Check for TeX produced content files. [ARXIVNG-2885] David
DavidLFielding Feb 27, 2020
386aa4b
Package indicator for checks package.
DavidLFielding Feb 27, 2020
1e8242b
Cleaned up cruft. [ARXIVNG-2885] David
DavidLFielding Feb 27, 2020
1075548
Add PyPDF2 package for PDF support routines needed for TeX-Produced c…
DavidLFielding Feb 28, 2020
99b5aa2
Cleanup. Fixed Postscript TeX-produced check. Added comments. [ARXIVN…
DavidLFielding Mar 3, 2020
04029fc
Generalized path to test files. [ARXIVNG-2885] David
DavidLFielding Mar 3, 2020
7cea3a2
Improved error checking. Improved Postscript TeX-produced message. [A…
DavidLFielding Mar 3, 2020
0242e31
Try without files tests that require poplar tools.
DavidLFielding Mar 10, 2020
28eb55b
Eliminated tests that rely on Poplar PDF utilities and file paths. [A…
DavidLFielding Mar 10, 2020
59c3471
Minor change. Forgot to remove one file-based test. [ARXIVNG-2885] David
DavidLFielding Mar 10, 2020
66b48a6
Removed file-based checks.
DavidLFielding Mar 10, 2020
dd71604
Removed import for file-based check.
DavidLFielding Mar 10, 2020
aaf1a8a
Added TODO for converting subprocess to python.
DavidLFielding Mar 11, 2020
f2e4a47
Changed package file name.
DavidLFielding Mar 11, 2020
1f7e855
Added size and time limits to preview content read. [ARXIVNG-2885] David
DavidLFielding Mar 11, 2020
85196fa
Added/cleaned up typing.
DavidLFielding Mar 16, 2020
73ef90e
Cleaned/fixed up typing.
DavidLFielding Mar 16, 2020
c6a6731
Addeded exception and fixed bug.
DavidLFielding Mar 17, 2020
50b9b7c
Merge branch 'develop' into ARXIVNG-2885
DavidLFielding Mar 17, 2020
9b54a9d
Fix bug with identifying TeXmacs PDFs. [ARXIVNG-4714] David
DavidLFielding Nov 22, 2021
File filter

Filter by extension

Filter by extension


Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
3 changes: 2 additions & 1 deletion Pipfile
Original file line number Diff line number Diff line change
Expand Up @@ -31,8 +31,9 @@ Jinja2 = ">=2.10.1"
pyyaml = ">=4.2b1"
arxiv-vault = "~=0.1.1rc15"
redis = "==2.10.6"
arxiv-submission-core = {path = "./core"}
mypy = "==0.720"
pypdf2 = "==1.26.0"


[dev-packages]
"nose2" = "*"
Expand Down
42 changes: 23 additions & 19 deletions Pipfile.lock

Some generated files are not rendered by default. Learn more about how customized files appear on GitHub.

2 changes: 1 addition & 1 deletion agent/agent/consumer.py
Original file line number Diff line number Diff line change
Expand Up @@ -29,7 +29,7 @@
3. The agent evaluates the event against registered :class:`.Rule` instances,
using :func:`.rules.evaluate`. A :class:`.Rule` maps a condition (the event
type and event/submission properties) to a :class:`.Process`.
4. The agent dispatches any triggered :class:`.Proccess` instances to the
4. The agent dispatches any triggered :class:`.Process` instances to the
:mod:`agent.worker` using the :class:`.AsyncProcessRunner`.


Expand Down
2 changes: 2 additions & 0 deletions core/arxiv/submission/process/checks/__init__.py
Original file line number Diff line number Diff line change
@@ -0,0 +1,2 @@
"""Core submission process checks."""

Binary file not shown.
Binary file not shown.
Binary file not shown.
Binary file not shown.
Binary file not shown.
Binary file not shown.
Binary file not shown.
Binary file not shown.
Binary file not shown.
Binary file not shown.
Binary file not shown.
Binary file not shown.
Binary file not shown.
Original file line number Diff line number Diff line change
@@ -0,0 +1,51 @@
These test submission exercise the check for TeX Produced
Postscript and PDF files.

Most test files either demonstrate TeX Produced PS/PDF or
normal non-TeX-Produced files.

PDF test files:

0611002.pdf
0706.3810.pdf - Not TeX produced
0706.3906.pdf
0706.3927.pdf
0706.3971.pdf - Not TeX produced
0706.4328.pdf
0706.4412.pdf
2738685LaTeX.pdf
2745765withCairoFonts.pdf - New: looks for Cairo fonts
2748220withCairoCreator.pdf - New: looks for Cairo software
GalluzziBalkancom2018.pdf
astro-ph-0610480.ethanneil.20289.pdf
astro-ph-0703077.jf_sauvage.10062.pdf
astro-ph.arimoto.4168.pdf
astro-ph.ewhelan.18488.pdf
math0607661.tudateru.25992.tsuda_takenawa.pdf
notex_compositionality.pdf - Not TeX produced
sparsemult6.pdf

Postscript test files:

0190238.ps
astro-ph.fdarcang.22633.ps
hep-th-0701130.pmho.24929.ps
math.kristaly.24457.ps
math.suri.13734.ps
notex_kkpants.eps - Not TeX produced
notex_orddps5.eps - Not TeX produced
physics-0611280.pdomokos.2059.eps
simple_tex_produced.ps



The legacy tests contains a small number of test files that
produced false negative and false positive results.

These files indicate the 'failure' of this check to clearly
distinguish a TeX-produced or non-TeX-produced file.

0609584.pdf - TeX produced but not detected as such
paperfinal.PDF - Not TeX produced but detected as TeX produced
submit_0169105.ps - TeX produced but not detected as such

Binary file not shown.
Binary file not shown.
Binary file not shown.
Binary file not shown.
Binary file not shown.
Binary file not shown.
Binary file not shown.
Binary file not shown.
Binary file not shown.
Binary file not shown.
Loading