Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Adjusting the dataset #4

Open
frankvp11 opened this issue May 1, 2023 · 0 comments
Open

Adjusting the dataset #4

frankvp11 opened this issue May 1, 2023 · 0 comments

Comments

@frankvp11
Copy link

frankvp11 commented May 1, 2023

Hello!
I am really happy with the project that you have put forward. However, there is a problem that I could use your help in fixing with the dataset.
The dataset includes many letters/numbers that are similar to one another. What I mean is, a handwritten be is often mis-interpreted as a 6 - even by humans. There are many such letters, which I would like to take out of the dataset. I kindof did this by doing !rm -rf {directory} on the images that I didn't want to include, however the model still predicts them nonetheless. How can I circumvent this?

Edit: I'm sure it has something to do with editing the CSV, and I'm currently trying to figure this stuff out via your dataprocessing.ipynb file, but i'm struggling

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant