Publications

Search Field

Alle anzeigen

2024

Oedingen, Marc; Engelhardt, Raphael C.; Denz, Robin; Hammer, Maximilian; Konen, Wolfgang

ChatGPT Code Detection: Techniques for Uncovering the Source of Code Artikel

In: arXiv preprint arXiv:2405.15512, 2024.

Links | BibTeX | Schlagwörter: AI, ChatGPT, Code Detection, Large Language Models, machine learning

Oedingen, Marc; Engelhardt, Raphael C.; Denz, Robin; Hammer, Maximilian; Konen, Wolfgang

ChatGPT Code Detection: Techniques for Uncovering the Source of Code Artikel

In: AI, Bd. 5, Nr. 3, S. 1066–1094, 2024, ISSN: 2673-2688.

Abstract | Links | BibTeX | Schlagwörter: AI, ChatGPT, Code Detection, Large Language Models, machine learning

@article{Oedingen2024a,

title = {ChatGPT Code Detection: Techniques for Uncovering the Source of Code},

author = {Marc Oedingen and Raphael C. Engelhardt and Robin Denz and Maximilian Hammer and Wolfgang Konen},

url = {https://www.mdpi.com/2673-2688/5/3/53},

doi = {10.3390/ai5030053},

issn = {2673-2688},

year  = {2024},

date = {2024-01-01},

urldate = {2024-01-01},

journal = {AI},

volume = {5},

number = {3},

pages = {1066–1094},

abstract = {In recent times, large language models (LLMs) have made significant strides in generating computer code, blurring the lines between code created by humans and code produced by artificial intelligence (AI). As these technologies evolve rapidly, it is crucial to explore how they influence code generation, especially given the risk of misuse in areas such as higher education. The present paper explores this issue by using advanced classification techniques to differentiate between code written by humans and code generated by ChatGPT, a type of LLM. We employ a new approach that combines powerful embedding features (black-box) with supervised learning algorithms including Deep Neural Networks, Random Forests, and Extreme Gradient Boosting to achieve this differentiation with an impressive accuracy of 98%. For the successful combinations, we also examine their model calibration, showing that some of the models are extremely well calibrated. Additionally, we present white-box features and an interpretable Bayes classifier to elucidate critical differences between the code sources, enhancing the explainability and transparency of our approach. Both approaches work well, but provide at most 85–88% accuracy. Tests on a small sample of untrained humans suggest that humans do not solve the task much better than random guessing. This study is crucial in understanding and mitigating the potential risks associated with using AI in code generation, particularly in the context of higher education, software development, and competitive programming.},

keywords = {AI, ChatGPT, Code Detection, Large Language Models, machine learning},

pubstate = {published},

tppubtype = {article}

}

Schließen

2021

Meissner, Simon

Untersuchung des Spiel- und Lernerfolgs künstlicher Intelligenzen für ein nichtdeterministisches Spiel mit imperfekten Informationen: Blackjack in der Game-Learning-Umgebung ’General Board Game’ (GBG) Abschlussarbeit

TH Köln – University of Applied Sciences, 2021, (Bachelor thesis).

Links | BibTeX | Schlagwörter: AI, BT-MT, Game Learning, GBG, machine learning

Zeh, Tim

Untersuchung von allgemeinen KI-Agenten für das Spiel Poker im General Board Games Framework Abschlussarbeit

TH Köln – University of Applied Sciences, 2021, (Master thesis).

Links | BibTeX | Schlagwörter: AI, BT-MT, Game Learning, GBG, machine learning

2020

Bagheri, Samineh

Self-Adjusting Surrogate-Assisted Optimization Techniques for Expensive Constrained Black Box Problems Promotionsarbeit

Leiden University and TH Köln, 2020, (PhD thesis).

BibTeX | Schlagwörter: BT-MT, machine learning, MONREP, optimization, RBF, SACOBRA, surrogate models

Scheiermann, Johannes

Sind (trainierte) General-Purpose-RL-Agenten im Brettspiel Othello stärker als (untrainierte) General-Game-Playing Agenten? Forschungsbericht

TH Köln, Institut für Informatik 2020, (Praxisprojekt).

Links | BibTeX | Schlagwörter: AI, BT-MT, Game Learning, GBG, machine learning, Reinforcement learning

Scheiermann, Johannes

AlphaZero-inspirierte KI-Agenten im General Board Game Playing Abschlussarbeit

TH Köln -- University of Applied Sciences, 2020, (Bachelor thesis).

Links | BibTeX | Schlagwörter: AI, BT-MT, Game Learning, GBG, machine learning, Reinforcement learning

2019

Cöln, Julian; Dittmar, Yannick

Untersuchung von KI Agenten im Spiel Othello Forschungsbericht

TH Köln, Institut für Informatik 2019.

Links | BibTeX | Schlagwörter: AI, BT-MT, Game Learning, GBG, machine learning, Reinforcement learning

Barsnick, Felix

Implementierung und Untersuchung eines Turniersystems für KI-Agenten in Brettspielen Abschlussarbeit

TH Köln -- University of Applied Sciences, 2019, (Master thesis).

Links | BibTeX | Schlagwörter: BT-MT, Elo, Game Learning, GBG, Glicko, machine learning, Reinforcement learning

2015

Koch, Patrick; Wagner, Tobias; Emmerich, Michael; Bäck, Thomas; Konen, Wolfgang

Efficient multi-criteria optimization on noisy machine learning problems Artikel

In: Applied Soft Computing, Bd. 29, S. 357-370, 2015.

Links | BibTeX | Schlagwörter: machine learning, TDMR

2011

Konen, Wolfgang

Der SFA-Algorithmus für Klassifikation Forschungsbericht

Research Center CIOP (Computational Intelligence, Optimization and Data Mining) Cologne University of Applied Science, Faculty of Computer Science and Engineering Science, Nr. 08/11, 2011, ISSN: 2191-365X.

Links | BibTeX | Schlagwörter: machine learning, SFA, SOMA

2009

Konen, Wolfgang; Bartz-Beielstein, Thomas

Reinforcement learning for games: failures and successes Proceedings Article

In: GECCO '09: Proceedings of the 11th Annual Conference Companion on Genetic and Evolutionary Computation Conference, S. 2641–2648, ACM, Montreal, Québec, Canada, 2009.

BibTeX | Schlagwörter: games, machine learning, Reinforcement learning

2008

Konen, Wolfgang; Bartz-Beielstein, Thomas

Reinforcement Learning: Insights from Interesting Failures in Parameter Selection Proceedings Article

In: and, Günter Rudolph (Hrsg.): PPSN'2008: 10th International Conference on Parallel Problem Solving From Nature, Dortmund, S. 478–487, Springer, Berlin, 2008.

BibTeX | Schlagwörter: learning, machine learning, Reinforcement learning

Search Field

13 Einträge « ‹ 1 von 2 › »

Oedingen, Marc; Engelhardt, Raphael C.; Denz, Robin; Hammer, Maximilian; Konen, Wolfgang

ChatGPT Code Detection: Techniques for Uncovering the Source of Code Artikel

In: arXiv preprint arXiv:2405.15512, 2024.

Links | BibTeX

Oedingen, Marc; Engelhardt, Raphael C.; Denz, Robin; Hammer, Maximilian; Konen, Wolfgang

ChatGPT Code Detection: Techniques for Uncovering the Source of Code Artikel

In: AI, Bd. 5, Nr. 3, S. 1066–1094, 2024, ISSN: 2673-2688.

Abstract | Links | BibTeX

@article{Oedingen2024a,

title = {ChatGPT Code Detection: Techniques for Uncovering the Source of Code},

author = {Marc Oedingen and Raphael C. Engelhardt and Robin Denz and Maximilian Hammer and Wolfgang Konen},

url = {https://www.mdpi.com/2673-2688/5/3/53},

doi = {10.3390/ai5030053},

issn = {2673-2688},

year  = {2024},

date = {2024-01-01},

urldate = {2024-01-01},

journal = {AI},

volume = {5},

number = {3},

pages = {1066–1094},

abstract = {In recent times, large language models (LLMs) have made significant strides in generating computer code, blurring the lines between code created by humans and code produced by artificial intelligence (AI). As these technologies evolve rapidly, it is crucial to explore how they influence code generation, especially given the risk of misuse in areas such as higher education. The present paper explores this issue by using advanced classification techniques to differentiate between code written by humans and code generated by ChatGPT, a type of LLM. We employ a new approach that combines powerful embedding features (black-box) with supervised learning algorithms including Deep Neural Networks, Random Forests, and Extreme Gradient Boosting to achieve this differentiation with an impressive accuracy of 98%. For the successful combinations, we also examine their model calibration, showing that some of the models are extremely well calibrated. Additionally, we present white-box features and an interpretable Bayes classifier to elucidate critical differences between the code sources, enhancing the explainability and transparency of our approach. Both approaches work well, but provide at most 85–88% accuracy. Tests on a small sample of untrained humans suggest that humans do not solve the task much better than random guessing. This study is crucial in understanding and mitigating the potential risks associated with using AI in code generation, particularly in the context of higher education, software development, and competitive programming.},

keywords = {},

pubstate = {published},

tppubtype = {article}

}