-
Notifications
You must be signed in to change notification settings - Fork 1
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Strange linebreaks in the speech #50
Comments
When reporting an error (in general, but especially from SWEDEB) we should get the ID attribute and/or line number of the xml element in question. It will make it easier to locate the problem example. |
I agree! Ping @fredrik1984 |
Do we know how SWEDEB handles line breaks? It's possibly because of this: It's no problem in terms of TEI, but it also makes the thing difficult to render nicely. I think one of @ninpnin has talked about how to join these in a reasonable way. (right?) |
Yes. I think this would be the reason. Ie its an example of segmentation error. |
Yes, when we in the Swerik project have implemented an ID for each speech, then SweDeb should use that (as well). I put this issue among the other segmentation-oriented issues in the backlog. |
I was looking at this speech. It had really strange line breaks in the middle of the speech when using the swedeb interface.
{"id":"prot-1909--ak--024_026","gender":"Man","party":"S","year":1909,"speaker":"Karl Starbäck"}
Start of speech:
We should probably try to fix similar issues in multiple documents.
The text was updated successfully, but these errors were encountered: