PDF Analyzer

Requirements

To setup, you only need to install:

Visual Studio 2022
- With the ".NET Desktop Development" workload
.NET 9 Preview 4
- https://github.com/dotnet/installer?tab=readme-ov-file#table, version 9.0.1xx-preview4 (9.0-preview4 Runtime)

Inside Visual Studio, you need to go to Tools->Options->Environment->Preview Features, and turn on "Use previews of the .NET SDK"

Downloading Models

You also need to download and extract the two models to the correct folders.

All models should be put in a folder that you need to create called "onnx-models", inside the project folder (PDFAnalyzer).

The models can be downloaded from the following links:

Clone the repositories and extract the models to the right folders.

Phi-3-mini-4k-instruct-onnx has 3 different versions inside it's repo. We are using the DirectML versions in this project. Create a folder called "phi3-directml-int4-awq-block-128" inside the "onnx-models" folder and copy the contents of the "directml/directml-int4-awq-block-128" folder to this new folder you created.

You don't need to modufy the PDFAnalyzer.csproj, as it is already including all the files in the onnx-models folder to the output directory.

The final folder structure should look like this:

PDFAnalyzer
│   onnx-models
│   ├── all-MiniLM-L6-v2
│   │   ├── model.onnx
│   │   ├── vocab.txt
│   ├── phi3-directml-int4-awq-block-128
│   │   ├── added_tokens.json
│   │   ├── genai_config.json
│   │   ├── model.onnx
│   │   ├── model.onnx.data
│   │   ├── special_tokens_map.json
│   │   ├── tokenizer_config.json
│   │   ├── tokenizer.json
│   │   ├── tokenizer.model

Name		Name	Last commit message	Last commit date
Latest commit History 39 Commits
PDFAnalyzer		PDFAnalyzer
.gitignore		.gitignore
CODE_OF_CONDUCT.md		CODE_OF_CONDUCT.md
LICENSE		LICENSE
PDFAnalyzer.sln		PDFAnalyzer.sln
README.md		README.md
SECURITY.md		SECURITY.md
SUPPORT.md		SUPPORT.md
nuget.config		nuget.config

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

PDF Analyzer

Requirements

Downloading Models

About

Releases

Packages

Contributors 4

Languages

License

Vivek4Int/PDFAnalyser

Folders and files

Latest commit

History

Repository files navigation

PDF Analyzer

Requirements

Downloading Models

About

Resources

License

Code of conduct

Security policy

Stars

Watchers

Forks

Releases

Packages 0

Contributors 4

Languages

Packages