montesmariana · Sib007 · May 2, 2023 · May 2, 2023 · May 2, 2023 · May 8, 2023
diff --git a/README.md b/README.md
@@ -1,43 +1,25 @@
-# A script to make you proud
-
-This repository contains a small Python program that shows that I have learned Python in this semester.
-
-The code has been developed by Mariana Montes.
-
-## Installation and usage
-
-Clone this repository with the following git command in the console (or download the ZIP file via GitHub):
-
-```sh
-git clone [email protected]:montesmariana/intro_machine_learning_using_python
-```
-
-You can import the script as a module by adding the repository to your path in a Python script or interactive notebook and then calling `import`.
-
-```python
-import sys
-sys.path.append('/path/to/intro_machine_learning_using_python')
-import script as s
-```
-
-Check out `tutorial.md` to see an example of the different functionalities of the script!
-
-You can also run the script directly with the following code in a console:
-
-```sh
-python script.py <example.json>
-```
-
-Or in Jupyter notebook with:
-
-```python
-%run script.py <example.json>
-```
-
-In both cases `example.json` stands for the `filename` argument that the script needs. You can use [the file in this repository](example.json) or a similar file of yours. Find more information on how this script works with:
-
-```sh
-python script.py --help
-```
-
-If you run this script, you become proud of yourself.
+# What I'm planning
+
+- Start from the translation job management script created for the first assignment, but expand on it to make it useful for a translation agency rather than an independent translator.
+- Three class attributes (strings) :
+    - Translator
+    - Revisor
+    - Status
+
+   -> The default value for "Translator" and "Revisor" is "Internal", meaning that an employee of the translation agency took up the job. If the agency assigned the job to a freelancer, the default value can be changed to their name.
+
+   -> The default value for "Status" is "Created". The status can then be updated as the project progresses to "In translation", "In revision", "Delivered", "Delayed" or "Cancelled". If possible, the script should only accept these six labels to prevent organisational chaos due to everyone using their own labels.
+- Instance and computed attributes remain the same as in the first assignment (with some edits to add the advice from the first assignment's feedback).
+- Add validation for unexpected input (+ for "Status" labels different from the six authorised labels?)
+- Add methods (?) to call the computed attributes and get a result that's more legible than what this currently generates (for example "22 days" instead of "datetime.timedelta(days=22)")
+- The input will still be read from a list of dictionaries in a separate json-file. Those dictionaries will be described in a separate markdown file.
+
+# What I'd like to add but don't know how
+
+- It would be neat if I could do something with the translation memory and termbase of each project. Maybe add a method that opens them for a preview?
+- It would also be super useful to have a way to align a source and a target text and generate a file that can be added to a translation memory. So, to start from two docx-files (or txt-files), split them into sentences and pair each sentence in the source text with the corresponding sentence in the target text and generate a single xml-file with the paired sentences. Ideally, the context (i.e. the surrounding sentences) should also be considered, but if that's not possible an aligned xml would already be awesome.
+
+# Things I'm not yet sure how to integrate into the assignment
+
+- Use of argparse.
+- Use of regular expressions.
diff --git a/Translation-technology_TBexample.csv b/Translation-technology_TBexample.csv
@@ -0,0 +1,24 @@
+Entry_ID,Entry_Subject,Entry_Domain,Entry_ClientID,Entry_ProjectID,Entry_Created,Entry_Creator,English,French
+0,Postgraduate programme translation technology,Translation technology,KU Leuven,Intro to Python,5/8/2023 3:40:45 PM,Sibylle,machine translation,traduction automatique
+1,Postgraduate programme translation technology,Translation technology,KU Leuven,Intro to Python,5/8/2023 3:43:07 PM,Sibylle,advanced topics,sujets avancés
+2,Postgraduate programme translation technology,Translation technology,KU Leuven,Intro to Python,5/8/2023 3:43:42 PM,Sibylle,translator,traducteur
+3,Postgraduate programme translation technology,Translation technology,KU Leuven,Intro to Python,5/8/2023 3:43:58 PM,Sibylle,localiser,localisateur
+4,Postgraduate programme translation technology,Translation technology,KU Leuven,Intro to Python,5/8/2023 3:44:10 PM,Sibylle,revisor,réviseur
+5,Postgraduate programme translation technology,Translation technology,KU Leuven,Intro to Python,5/8/2023 3:44:27 PM,Sibylle,website,site web
+6,Postgraduate programme translation technology,Translation technology,KU Leuven,Intro to Python,5/8/2023 3:44:43 PM,Sibylle,software,software
+7,Postgraduate programme translation technology,Translation technology,KU Leuven,Intro to Python,5/8/2023 3:44:53 PM,Sibylle,game,jeu
+8,Postgraduate programme translation technology,Translation technology,KU Leuven,Intro to Python,5/8/2023 3:45:02 PM,Sibylle,subtitler,sous-titreur
+9,Postgraduate programme translation technology,Translation technology,KU Leuven,Intro to Python,5/8/2023 3:45:12 PM,Sibylle,post-editor,post-éditeur
+10,Postgraduate programme translation technology,Translation technology,KU Leuven,Intro to Python,5/8/2023 3:45:40 PM,Sibylle,technical writer,rédacteur technique
+11,Postgraduate programme translation technology,Translation technology,KU Leuven,Intro to Python,5/8/2023 3:46:02 PM,Sibylle,computational linguist,linguiste informatique
+12,Postgraduate programme translation technology,Translation technology,KU Leuven,Intro to Python,5/8/2023 3:46:15 PM,Sibylle,translation technology,technologies de la traduction
+13,Postgraduate programme translation technology,Translation technology,KU Leuven,Intro to Python,5/8/2023 3:46:34 PM,Sibylle,work placement,stage
+14,Postgraduate programme translation technology,Translation technology,KU Leuven,Intro to Python,5/8/2023 3:48:18 PM,Sibylle,data,données
+15,Postgraduate programme translation technology,Translation technology,KU Leuven,Intro to Python,5/8/2023 3:50:06 PM,Sibylle,statistical,statistique
+16,Postgraduate programme translation technology,Translation technology,KU Leuven,Intro to Python,5/8/2023 3:50:31 PM,Sibylle,neural,neuronal
+17,Postgraduate programme translation technology,Translation technology,KU Leuven,Intro to Python,5/8/2023 3:50:41 PM,Sibylle,hybrid,hybride
+18,Postgraduate programme translation technology,Translation technology,KU Leuven,Intro to Python,5/8/2023 3:50:51 PM,Sibylle,adaptive,adaptatif
+19,Postgraduate programme translation technology,Translation technology,KU Leuven,Intro to Python,5/8/2023 3:53:07 PM,Sibylle,MT,traduction automatique
+20,Postgraduate programme translation technology,Translation technology,KU Leuven,Intro to Python,5/8/2023 3:53:31 PM,Sibylle,MT engine,engin de traduction automatique
+21,Postgraduate programme translation technology,Translation technology,KU Leuven,Intro to Python,5/8/2023 3:54:20 PM,Sibylle,post-editing,post-édition
+22,Postgraduate programme translation technology,Translation technology,KU Leuven,Intro to Python,5/8/2023 3:55:51 PM,Sibylle,pre-editing,pré-édition
diff --git a/Translation-technology_TBexample.xdl b/Translation-technology_TBexample.xdl