Directory Error #11

Spruces · 2020-10-09T01:57:46Z

Hello, I tried to extract the features with extract_features.py

and the result is just like this.

I don't get where the open_data_type_78_header_valid.csv's location is.

Can I just use this model as easier as Sato Web Demo,
just upload and get return value?

Too complicate to use, and not much instructions

The text was updated successfully, but these errors were encountered:

jason022085 · 2021-03-17T09:52:31Z

Same error occurs to me.
I think the solution is to understand the usage of Viznet, because there is similar code in it.
https://github.com/mitmedialab/viznet

Spruces · 2021-03-17T10:00:47Z

Same error occurs to me.
I think the solution is to understand the usage of Viznet, because there is similar code in it.
https://github.com/mitmedialab/viznet

Thanks bro, it's weird.
To execute it, you need certain .csv files(like open_data_type_78_header_valid.csv) that doesn't provided by sato.

and I had examined the codes, you need certain type of NLTKs to generate open_data_type_78_header_valid.csv.

Also, after I found certain types of NLTKs, I couldn't figure out where exactly the NLTK should be located in certain directory.

Poor guidelines.

Spruces · 2021-03-17T10:07:38Z

And how they can detect semantic columns with NLTKs?

Like my ID, and yours.
They don't have any semantic things as they are just nouns.

I think that for detecting 'ID' columns based on our id(Spruces,jason022085) they need something else like the lengths of inputs or how many blanks they had, and the number of separator comma(address).

cus there's no meaning in our names of addresses as themselves.

jason022085 · 2021-03-19T02:01:12Z

p.s. Remember to download VizNet data first(produced by retrieve_corpora.sh) and set RAW_DIR to the path of raw data form Viznet
In my case, it is os.environ['RAW_DIR'] = r"D:\viznet-master\raw"

Since other dataset are too large, I am working on "manyeyes" dataset.

And I found there is a extract_header.py. It may help

I got it ! You have to extract header first, and then extract features

RichaMax · 2021-03-24T10:40:45Z

Hello, from what I understand you can download open_data_type_78_header_valid.csv and the sherlocks features they used using their script ./download_data.sh

horseno · 2021-03-24T22:25:28Z

Sorry about the confusion. If you wish to extract the features, you'll first need to run extract_header.py to get the valid headers (in the 78 types after canonicalization) from the dataset.

iganand · 2021-09-09T16:03:48Z

I have empty header files generated when I try extract_headers.py for manyeyes. Can you please help? What it might be. Please let me know

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Directory Error #11

Directory Error #11

Spruces commented Oct 9, 2020

jason022085 commented Mar 17, 2021

Spruces commented Mar 17, 2021 •

edited

Loading

Spruces commented Mar 17, 2021

jason022085 commented Mar 19, 2021 •

edited

Loading

RichaMax commented Mar 24, 2021 •

edited

Loading

horseno commented Mar 24, 2021

iganand commented Sep 9, 2021

Directory Error #11

Directory Error #11

Comments

Spruces commented Oct 9, 2020

jason022085 commented Mar 17, 2021

Spruces commented Mar 17, 2021 • edited Loading

Spruces commented Mar 17, 2021

jason022085 commented Mar 19, 2021 • edited Loading

RichaMax commented Mar 24, 2021 • edited Loading

horseno commented Mar 24, 2021

iganand commented Sep 9, 2021

Spruces commented Mar 17, 2021 •

edited

Loading

jason022085 commented Mar 19, 2021 •

edited

Loading

RichaMax commented Mar 24, 2021 •

edited

Loading