Skip to content

collaite/datasets

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

29 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Datasets

These datasets were created for the COLLaiTE project and can be used to experiment with the Collens visualization tool. For detailed instructions about Collens and how to upload data, see the Collens documentation here.

Aladdin

Background

Raymond Brulez (1895-1972) wrote the short story collection Sheherazade of Literatuur als Losprijs between 1929-1931; it was published in 1932. Inspired by the famous Arabian Nights storyteller, Brulez plays with the theme of an author who struggles to write. His literary archive contains many notes, draft manuscripts, and corrected typescripts that together represent beautifully his own authoring process and struggles as a writer. For the COLLaiTE project, we used two draft versions of the story Aladdin: the quarto typescript and the folio typescript. Since both versions contain revisions in Brulez' handwriting, we can distinguish several writing stages.

The transcriptions of Aladdin are created by the Centre for Manuscript Genetics of the University of Antwerp.

TEI XML encoding

The text of each version, including the authorial revisions, is trancribed in TEI-compliant XML, following the encoding manual of the Beckett Digital Manuscript Project. Based on the <del> and <add> elements, we gerenated two plain text files from each XML file: the first version, which contains all the original text before Brulez' made any revisions, and the version after Brulez revised it (the "final" version). By comparing these two "intermediate" versions with one another, we gain detailed insight into the writing process of Brulez. The two versions are visualized in the Collens application.

License

This dataset is licensed under the Creative Commons Attribution-NonCommercial 4.0 International License.

Copyright holders: University of Antwerp, Centre for Manuscript Genetics; Huygens Institute.

You are free to:

  • Share — copy and redistribute the material in any medium or format
  • Adapt — remix, transform, and build upon the material

Under the following terms:

  • Attribution — You must give appropriate credit to the University of Antwerp, provide a link to the license, and indicate if changes were made. You may do so in any reasonable manner, but not in any way that suggests the licensor endorses you or your use.
  • NonCommercial — You may not use the material for commercial purposes.
  • No additional restrictions — You may not apply legal terms or technological measures that legally restrict others from doing anything the license permits.

Citation

When using the Brulez XML files, please cite:

[Brulez, Raymond, "Aladdin", 1929-1930. Encoded by Elli Bleeker, Vincent Neyt, and Dirk Van Hulle. 2017, Centre for Manuscript Genetics, University of Antwerp.]

For more details about this license, visit: https://creativecommons.org/licenses/by-nc/4.0/.

About

Datasets used in the CollAIte project

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors