Software Engineer with a background in Machine Learning and Security. I have previously built
- a SaaS and MLOps platform for highly efficient and multilingual Text Classification
- an infrastructure automation framework
- LLM harnesses and apps
- large language corpora
- OSS tools and learning materials
Also, I
- contributed to computational methods in the field of Digital Humanities
- worked on XML terminology representation standards with ISO/DIN
- have a lot of unpublished data on metaphor use in historical German novels (unfinished PhD)
Publications
- Jannidis, F., Reimers, N., Pernes, S., Pielström, S., Vitt, T. (2016). DARIAH-DKPro-Wrapper Output Format (DOF) Specification. DARIAH-DE Working Paper. urn:nbn:de:gbv:7-dariah-2016-6-2
- Pernes, S., Pielström, S., Bock, S., Du K., Huber, M. (2016). Der Einsatz quantitativer Textanalyse in den Geisteswissenschaften: Bericht über den Stand der Forschung. DARIAH-DE Working Paper. urn:nbn:de:gbv:7-dariah-2016-4-0
- Aurast, A., Gradl, T., Pernes, S., and Pielström, S. (2016). Big Data und Smart Data in den Geisteswissenschaften. In: Bibliothek - Forschung und Praxis, Band 40, Heft 2: pp. 200-206, doi:10.1515/bfp-2016-0033
- Schumacher, M., Held, M., Falk, C., and Pernes, S. (2016). Big Data in den Geisteswissenschaften: Konzept für eine Lehr- und Lernmittelsammlung. DARIAH-DE Working Papers Nr. 15. Göttingen: DARIAH-DE. urn:nbn:de:gbv:7-dariah-2016-1-2
- Pernes, S. (2013). Die große Freiheit der kleinen Bücher. In: Schulheft, No.151. pp. 87-92. Innsbruck: StudienVerlag.
- Bowers, J., Pernes, S., Romary, L. (2017). conceptEntry: A TBX-based expansion of the TEI for the encoding of onomasiological and comparative lexical data. In TEI Members Meeting 2017: Conference Abstracts. University of Victoria, Victoria BC.
- Pernes, S., Romary, L., Warburton, K. (2017). TBX in ODD: Schema-agnostic specification and documentation for TermBase eXchange. In Proceedings of Language, Ontology, Terminology and Knowledge Structures Workshop (LOTKS 2017). ACL Anthology W17-7000
- Pernes, S., Keller, L., Peterek, C. (2017). Aufbau eines historisch-literarischen Metaphernkorpus für das Deutsche. In DHd 2017, Digital Humanities im deutschsprachigen Raum: Konferenzabstracts. Universität Bern, Bern, pp. 91-94.
- Pernes, S. (2016). Metaphor Mining in Historical German Novels: Using Unsupervised Learning to Uncover Conceptual Systems in Literature. In Digital Humanities 2016: Conference Abstracts. Jagiellonian University and Pedagogical University, Kraków, pp. 651-653.
- Reimers, N., Jannidis, F., Pernes, S., Pielström, S., Reger, I., Vitt, T. (2016). A Tool or NLP-Preprocessing in Literary Text Analysis. In Digital Humanities 2016: Conference Abstracts. Jagiellonian University and Pedagogical University, Kraków, pp. 871-872.
- Pernes, S. (2015). Metaphor Mining in Historical German Novels: An Unsupervised Learning Approach. In: Proceedings of the IEEE International Conference on Big Data 2015, Santa Clara, pp.1650-1652. doi:10.1109/BigData.2015.7363934
- Schreger, C. and Pernes, S. (2014). The Big World of ‚Little Books’. In Hélot, C., Sneddon, R. and Daly, N. (eds). Children’s Literature in the Multilingual Classroom. London: IOE Press, pp. 154-171
- Pielström, S., Pernes, S., Reimers, N., Bock, S., Dürholt, P., Du, K. (2015): NLP Based Analysis of Literary Texts (https://dariah-de.github.io/DARIAH-DKPro-Wrapper/tutorial.html).
- ownLLM. Technische Universität Wien / CAIML Workshop at 7th International B2B Software Days, 09.05.2023, Wiener Rathaus
- Text Ops: Language Automation and the End of the Web as we have known it. AI Speaker Night, 23.03.2023, Talentgarden Vienna
- conceptEntry: A TBX-based expansion of the TEI for the encoding of onomasiological and comparative lexical data. TEI Members Meeting and Linguistics SIG Meeting, 13./14.11.2017, Victoria BC
- TBX in ODD: Schema-agnostic specification and documentation for TermBase eXchange. Language, Ontology, Terminology and Knowledge Structures Workshop (LOTKS 2017), 19.09.2017, Montpellier
- conceptEntry. A TBX-based expansion of the TEI for the encoding of onomasiological and comparative lexical data. ISO/TC 37 Annual Meeting, 29.06.2017, Vienna
- TBX in TEI. A TBX-based expansion of the TEI for the encoding of onomasiological and comparative lexical data. ALMAnaCH Kick-off Meeting, 18.05.2017, Berlin
- Aufbau eines historisch-literarischen Metaphernkorpus für das Deutsche. DHd 2017, Digital Humanities im deutschsprachigen Raum, 15.02.2017, Bern
- Aufbau eines historisch-literarischen Metaphernkorpus für das Deutsche. Stuttgart Research Center for Text Studies, 16.11.2016, Stuttgart
- Metaphor Mining in Historical German Novels: Using Unsupervised Learning to Uncover Conceptual Systems in Literature. Digital Humanities 2016, 14.07.2016, Kraków
- A Tool for NLP-Preprocessing in Literary Text Analysis. Digital Humanities 2016, 11.-16.07.2016, Kraków (Poster)
- A Tool for NLP-Preprocessing in Literary Text Analysis. DHd 2016, Digital Humanities im deutschsprachigen Raum, 07.-12.03.2016, Leipzig (Poster)
- A Tool for NLP-Preprocessing in Literary Text Analysis. DARIAH-DE Grand Tour, 18.-19.02.2016, Göttingen (Poster)
- Readability Measures. Workshop ’Complexity Measures in Stylometry’. DARIAH-DE Expertenworkshop, 07.12.2015, Würzburg
- Metaphor Mining in Historical German Novels: An Unsupervised Learning Approach. 3rd Workshop on Big Humanities Data, IEEE International Conference on Big Data 2015, 29.10.2015, Santa Clara
- Natural Language Processing zur Analyse literarischer Texte. Workshop Natural Language Processing für Literaturwissenschafter. DARIAH-DE Methodenworkshop, 16.09.2015, Würzburg
- Introduction of DARIAH-EU Working Group ’Text and Data Analytics’. DARIAH-EU 5th General VCC meeting, 22.04.2015, Ljubljana
- Big, complex, heterogeneous.. Laufende Projekte aus dem Arbeitsbereich ’Big Data in den Geisteswissenschaften’ in DARIAH-DE. Digital Humanities Summit 2015, 03.-04.03.2015, Berlin (Poster)
- Big, complex, heterogeneous.. Laufende Projekte aus dem Arbeitsbereich ’Big Data in den Geisteswissenschaften’ in DARIAH-DE. DHd 2015, Digital Humanities im deutschsprachigen Raum, 23.-27.02.2015, Graz (Poster)