main.deepforest.predict_file should be able to take in a dataframe, not just a csv file. #797

bw4sz · 2024-09-30T23:51:19Z

Looking at the code base.

https://deepforest.readthedocs.io/en/latest/_modules/deepforest/main.html#deepforest.predict_file

it actually looks like dataset.TreeDataset() is the problem here, because utilities.read_file is definitely flexible enough. The docstring of predict_file should be updated and the argument name should be changed (but a deprecation warning added until 2.0), but its the dataset class, which the user really never sees that often that can be updated. I think this would still be considered a patch.

Abhishek-kumar0503 · 2024-11-04T09:03:02Z

After read your statement it seems that utilities.read_file is reads the input CSV file and is flexible and the dataset.TreeDataset is takes the data from the CSV file and organizes it for prediction and here the issue: TreeDataset isn't flexible enough to handle different formats or structures of input data.
My solution is to Update TreeDataset to be more flexible, so i make the class of name is TreeDataset:

class TreeDataset:
      def __init__(self, csv_file, root_dir, transforms=None, train=False, column_mapping=None):
          self.root_dir = root_dir
          self.transforms = transforms
          self.train = train
          self.column_mapping = column_mapping or {
              "image_path": "image_path",
              "xmin": "xmin",
              "ymin": "ymin",
              "xmax": "xmax",
              "ymax": "ymax"
          }
       
          # Load the CSV file with flexible handling
          self.annotations = self._load_annotations(csv_file)

      def _load_annotations(self, csv_file):
          # Read the CSV file using the existing utility
          df = utilities.read_file(csv_file)
  
          # Rename columns based on column_mapping, if necessary
          df = df.rename(columns=self.column_mapping)
  
          return df

and update the predict_file
ds = dataset.TreeDataset(csv_file=csv_file, root_dir=root_dir, transforms=None, train=False)

Is this correct?

henrykironde · 2024-11-04T11:36:00Z

@Abhishek-kumar0503, It looks like you're on the right track. Could you submit a PR?.

Abhishek-kumar0503 · 2024-11-04T17:25:26Z

ya sure @henrykironde . i set up this project in my laptop. I have one query that if i change or update some function in main.py file where it will display that changes.

bw4sz added good first issue Good for newcomers API This tag is used for small improvements to the readability and usability of the python API. labels Sep 30, 2024

henrykironde closed this as completed Nov 4, 2024

henrykironde reopened this Nov 4, 2024

Abhishek-kumar0503 mentioned this issue Nov 16, 2024

fix issue #797 predict_file should be able to take in a dataframe #838

Open

naxatra2 linked a pull request Dec 12, 2024 that will close this issue

updated main.deepforest.predict_file for DF usage #852

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

main.deepforest.predict_file should be able to take in a dataframe, not just a csv file. #797

main.deepforest.predict_file should be able to take in a dataframe, not just a csv file. #797

bw4sz commented Sep 30, 2024

Abhishek-kumar0503 commented Nov 4, 2024 •

edited

Loading

henrykironde commented Nov 4, 2024

Abhishek-kumar0503 commented Nov 4, 2024 •

edited

Loading

main.deepforest.predict_file should be able to take in a dataframe, not just a csv file. #797

main.deepforest.predict_file should be able to take in a dataframe, not just a csv file. #797

Comments

bw4sz commented Sep 30, 2024

Abhishek-kumar0503 commented Nov 4, 2024 • edited Loading

henrykironde commented Nov 4, 2024

Abhishek-kumar0503 commented Nov 4, 2024 • edited Loading

Abhishek-kumar0503 commented Nov 4, 2024 •

edited

Loading

Abhishek-kumar0503 commented Nov 4, 2024 •

edited

Loading