Skip to content
View neospe's full-sized avatar

Block or report neospe

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
neospe/README.md

Hi, I'm Stefan 👋

Software Engineer with a background in Machine Learning and Security. I have previously built

  • a SaaS and MLOps platform for highly efficient and multilingual Text Classification
  • an infrastructure automation framework
  • LLM harnesses and apps
  • large language corpora
  • OSS tools and learning materials

Also, I

  • contributed to computational methods in the field of Digital Humanities
  • worked on XML terminology representation standards with ISO/DIN
  • have a lot of unpublished data on metaphor use in historical German novels (unfinished PhD)
Publications

Journal Articles

  • Jannidis, F., Reimers, N., Pernes, S., Pielström, S., Vitt, T. (2016). DARIAH-DKPro-Wrapper Output Format (DOF) Specification. DARIAH-DE Working Paper. urn:nbn:de:gbv:7-dariah-2016-6-2
  • Pernes, S., Pielström, S., Bock, S., Du K., Huber, M. (2016). Der Einsatz quantitativer Textanalyse in den Geisteswissenschaften: Bericht über den Stand der Forschung. DARIAH-DE Working Paper. urn:nbn:de:gbv:7-dariah-2016-4-0
  • Aurast, A., Gradl, T., Pernes, S., and Pielström, S. (2016). Big Data und Smart Data in den Geisteswissenschaften. In: Bibliothek - Forschung und Praxis, Band 40, Heft 2: pp. 200-206, doi:10.1515/bfp-2016-0033
  • Schumacher, M., Held, M., Falk, C., and Pernes, S. (2016). Big Data in den Geisteswissenschaften: Konzept für eine Lehr- und Lernmittelsammlung. DARIAH-DE Working Papers Nr. 15. Göttingen: DARIAH-DE. urn:nbn:de:gbv:7-dariah-2016-1-2
  • Pernes, S. (2013). Die große Freiheit der kleinen Bücher. In: Schulheft, No.151. pp. 87-92. Innsbruck: StudienVerlag.

Conference Papers

  • Bowers, J., Pernes, S., Romary, L. (2017). conceptEntry: A TBX-based expansion of the TEI for the encoding of onomasiological and comparative lexical data. In TEI Members Meeting 2017: Conference Abstracts. University of Victoria, Victoria BC.
  • Pernes, S., Romary, L., Warburton, K. (2017). TBX in ODD: Schema-agnostic specification and documentation for TermBase eXchange. In Proceedings of Language, Ontology, Terminology and Knowledge Structures Workshop (LOTKS 2017). ACL Anthology W17-7000
  • Pernes, S., Keller, L., Peterek, C. (2017). Aufbau eines historisch-literarischen Metaphernkorpus für das Deutsche. In DHd 2017, Digital Humanities im deutschsprachigen Raum: Konferenzabstracts. Universität Bern, Bern, pp. 91-94.
  • Pernes, S. (2016). Metaphor Mining in Historical German Novels: Using Unsupervised Learning to Uncover Conceptual Systems in Literature. In Digital Humanities 2016: Conference Abstracts. Jagiellonian University and Pedagogical University, Kraków, pp. 651-653.
  • Reimers, N., Jannidis, F., Pernes, S., Pielström, S., Reger, I., Vitt, T. (2016). A Tool or NLP-Preprocessing in Literary Text Analysis. In Digital Humanities 2016: Conference Abstracts. Jagiellonian University and Pedagogical University, Kraków, pp. 871-872.
  • Pernes, S. (2015). Metaphor Mining in Historical German Novels: An Unsupervised Learning Approach. In: Proceedings of the IEEE International Conference on Big Data 2015, Santa Clara, pp.1650-1652. doi:10.1109/BigData.2015.7363934

Book Chapters

  • Schreger, C. and Pernes, S. (2014). The Big World of ‚Little Books’. In Hélot, C., Sneddon, R. and Daly, N. (eds). Children’s Literature in the Multilingual Classroom. London: IOE Press, pp. 154-171

Teaching Materials

Talks

  • ownLLM. Technische Universität Wien / CAIML Workshop at 7th International B2B Software Days, 09.05.2023, Wiener Rathaus
  • Text Ops: Language Automation and the End of the Web as we have known it. AI Speaker Night, 23.03.2023, Talentgarden Vienna
  • conceptEntry: A TBX-based expansion of the TEI for the encoding of onomasiological and comparative lexical data. TEI Members Meeting and Linguistics SIG Meeting, 13./14.11.2017, Victoria BC
  • TBX in ODD: Schema-agnostic specification and documentation for TermBase eXchange. Language, Ontology, Terminology and Knowledge Structures Workshop (LOTKS 2017), 19.09.2017, Montpellier
  • conceptEntry. A TBX-based expansion of the TEI for the encoding of onomasiological and comparative lexical data. ISO/TC 37 Annual Meeting, 29.06.2017, Vienna
  • TBX in TEI. A TBX-based expansion of the TEI for the encoding of onomasiological and comparative lexical data. ALMAnaCH Kick-off Meeting, 18.05.2017, Berlin
  • Aufbau eines historisch-literarischen Metaphernkorpus für das Deutsche. DHd 2017, Digital Humanities im deutschsprachigen Raum, 15.02.2017, Bern
  • Aufbau eines historisch-literarischen Metaphernkorpus für das Deutsche. Stuttgart Research Center for Text Studies, 16.11.2016, Stuttgart
  • Metaphor Mining in Historical German Novels: Using Unsupervised Learning to Uncover Conceptual Systems in Literature. Digital Humanities 2016, 14.07.2016, Kraków
  • A Tool for NLP-Preprocessing in Literary Text Analysis. Digital Humanities 2016, 11.-16.07.2016, Kraków (Poster)
  • A Tool for NLP-Preprocessing in Literary Text Analysis. DHd 2016, Digital Humanities im deutschsprachigen Raum, 07.-12.03.2016, Leipzig (Poster)
  • A Tool for NLP-Preprocessing in Literary Text Analysis. DARIAH-DE Grand Tour, 18.-19.02.2016, Göttingen (Poster)
  • Readability Measures. Workshop ’Complexity Measures in Stylometry’. DARIAH-DE Expertenworkshop, 07.12.2015, Würzburg
  • Metaphor Mining in Historical German Novels: An Unsupervised Learning Approach. 3rd Workshop on Big Humanities Data, IEEE International Conference on Big Data 2015, 29.10.2015, Santa Clara
  • Natural Language Processing zur Analyse literarischer Texte. Workshop Natural Language Processing für Literaturwissenschafter. DARIAH-DE Methodenworkshop, 16.09.2015, Würzburg
  • Introduction of DARIAH-EU Working Group ’Text and Data Analytics’. DARIAH-EU 5th General VCC meeting, 22.04.2015, Ljubljana
  • Big, complex, heterogeneous.. Laufende Projekte aus dem Arbeitsbereich ’Big Data in den Geisteswissenschaften’ in DARIAH-DE. Digital Humanities Summit 2015, 03.-04.03.2015, Berlin (Poster)
  • Big, complex, heterogeneous.. Laufende Projekte aus dem Arbeitsbereich ’Big Data in den Geisteswissenschaften’ in DARIAH-DE. DHd 2015, Digital Humanities im deutschsprachigen Raum, 23.-27.02.2015, Graz (Poster)

Pinned Loading

  1. autofit2 autofit2 Public

    Automated end-to-end data preprocessing, model training, and evaluation pipeline

    Python

  2. autoops autoops Public

    Multi-region data and service mesh - operated by a Makefile.

    Python

  3. dataload dataload Public

    A collection of data set loaders

    Python

  4. tools tools Public

    Digital Philologist's toolbox

    Python

  5. sentence-transformers sentence-transformers Public

    Forked from huggingface/sentence-transformers

    [⚖️ Entropy-based Attention Regularization (EAR) Mod] State-of-the-Art Embeddings, Retrieval, and Reranking

    Python

  6. simplex-chat simplex-chat Public

    Forked from simplex-chat/simplex-chat

    [🐡 OpenBSD port] SimpleX - the first messaging network operating without user identifiers of any kind - 100% private by design! iOS, Android and desktop apps 📱!

    Haskell