Infuse

This project aims to create a pdf-processing Rust library, à la Grobid, which can be used to read scientific pdfs as if they were normal web pages. It will then be integrated in a webapp by compiling the whole thing to Wasm.

The implementation is still embryonic. But there is an interesting presentation (37m talk, 18m questions), and associated slides!

Status

Reading pdfs works, in the browser also.

Current work is focused on piecing together the various objects encoded in the pdf in orderto reconstruct the tree of content, including full body text, while also classifying those pieces into the various types we're interested in (footnote, caption, metadata, body, ...).

Name		Name	Last commit message	Last commit date
Latest commit History 32 Commits
app		app
documentation		documentation
src		src
.gitignore		.gitignore
Cargo.toml		Cargo.toml
README.md		README.md
build-app.sh		build-app.sh
package.json		package.json
webpack.config.js		webpack.config.js

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Infuse

Status

About

Releases

Packages

Languages

wehlutyk/infuse

Folders and files

Latest commit

History

Repository files navigation

Infuse

Status

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages