The electronic MoCA dataset description for users. Adapted from Thai MoCA test
The dataset consists of two main parts:
- Demographic
demographic data related to patients or subject participating in cognitive assessment with e-MoCA. The pariticipant information is encoded with numerical code and hash number identifiers.
- Cognitive Assessment
Cognive assessment data from each test composing the e-MoCA tools. The assessments are divided into 7 categories by cognitive function in the same manner of original MoCA test which are 1. Visual + Spatial 2. Naming 3. Memory 4. Attention 5. Language 6. Abstraction 7. Orientation
In this dataset, the demographic data is stored in 3 files
-
summary.txt containing following columns
No. Column name Description 1 patient_id Identifier of subject 2 citizen_id Masked citizen id 3 staff_id Identifier for a practitioner who administrate the assessment to the subject 4 education Numerical code for education as described in edu.txt
5 test_date Datetime when the test is being administrated 6 test_province Province in which subject participate in assessment 7 test_location The assessment sites as described in location.txt
other columns is not relate to the cognitive assessment and can be dropped
-
edu.txt file containing map between numeric code and the actual value of education
-
location.txt Assessment site
The dataset is divided into 7 categories in the same way as original MoCA test design. however, our tools impliment Thai language version
-
The first category is Visuo/spatial domain
- Containing all
visuospatial_p{number}.txt
files. The other files dubbedvisuospatial_p{number}t.txt
is time series data that we can neglect it for now. - The important data for each table in a file is following
No Colume name Description 1 patient_id Identifier of subject 2 score Evaluated scaled score by the psychologist administrated the assessment (only use value which seq_no = 1
)3 create_date The datetime that the data is recorded 4 seq_no The number specified the order of the evaluation where 0
indicate automated data collection by eMoCA and1
indicate evaluation process by psychologist - Containing all
-
The second one is Naming domain
- The only file that should be used is
naming_score.txt
with the following description
No Colume name Description 1 patient_id Identifier of subject 2 score Evaluated scaled score by the psychologist 3 create_date The datetime that the data is recorded 4 animal{#num} The participant answer to the corresponding question 5 animal{#num}_check Psychologists evaluation for that question - The only file that should be used is
-
The third one is Memory/Delayed Recall domain
- This domain contain two tasks, Meomorization and Recall. However, only Recall is being evaluated as scaled score in original MoCA test so, we can use only
delayed_recall.txt
file. The important columns are listed here
No Colume name Description 1 patient_id Identifier of subject 2 score Evaluated domain's scaled score by the psychologist 3 create_date The datetime that the data is recorded - This domain contain two tasks, Meomorization and Recall. However, only Recall is being evaluated as scaled score in original MoCA test so, we can use only
-
The fourth one is Attention domain
- This domain has 4 questions. Each question assessment is store in
att_p{num}.txt
. The most important columns in each files are the same.
No Colume name Description 1 patient_id Identifier of subject 2 score Evaluated question's scaled score by the psychologist 3 create_date The datetime that the data is recorded However, there are some additional columns in
att_p1.txt
,att_p2.txt
andatt_p4.txt
which are answer{num} that store the answer from participant during the test. And foratt_p3.txt
there is columnstimecount
storing the total number of times participant knock the table (responding to task assignment). - This domain has 4 questions. Each question assessment is store in
-
The fifth one is Language domain
- This domain is quite overlooked in terms of language assessment, so we have only two question stored in files
lang_p{num}.txt
. The important columns are listed here
No Colume name Description 1 patient_id Identifier of subject 2 score Evaluated question's scaled score by the psychologist 3 create_date The datetime that the data is recorded Additional column in part one is in part two where the total number of ก.ไก่ word maybe be of use for analysis in column
wordcount
- This domain is quite overlooked in terms of language assessment, so we have only two question stored in files
-
The sixth one is Abstraction domain
- This domain is stored in only one
abs.txt
file.
No Colume name Description 1 patient_id Identifier of subject 2 score Evaluated domain's scaled score by the psychologist 3 answer{num} Evaluate score by psychologist for each question 4 create_date The datetime that the data is recorded - This domain is stored in only one
-
The last one is Orientation domain
- This domain is stored in only one
ori.txt
file.
No Colume name Description 1 patient_id Identifier of subject 2 score Evaluated domain's scaled score by the psychologist 3 answer{num} Evaluate score by psychologist for each question 4 create_date The datetime that the data is recorded - This domain is stored in only one