Please use this identifier to cite or link to this item: http://dspace.azjhpc.org/xmlui/handle/123456789/194
Title: How chatgpt Works: Understanding the Architecture and Training Process of chatgpt
Authors: Hajiyev, Aligulu
Keywords: chatgpt;openai;probability distribution;self-attention mechanisms
Issue Date: 11-May-2023
Publisher: Azərbaycan Dövlət Neft və Sənaye Universiteti
Abstract: Chatgpt is an openai conversational AI system built on a transformer architecture with self-attention methods. Openai used a vast quantity of text data from the internet to train the model, and the machine learnt from the data via unsupervised learning. The model fine-tuned on a smaller dataset of talks following training to increase its ability to provide coherent and contextually relevant replies. During inference, the model analyses the input text and develops a probability distribution across all potential answers, after which the response with the highest probability chosen as the output. Overall, chatgpt is a remarkable achievement of natural language processing and deep learning, with several potential applications in customer assistance, education, content development, and targeted marketing.
URI: http://dspace.azjhpc.org/xmlui/handle/123456789/194
Journal Title: 1st INTERNATIONAL CONFERENCE ON THE 4th INDUSTRIAL REVOLUTION AND INFORMATION TECHNOLOGY
metadata.dc.source.booktitle: 1st INTERNATIONAL CONFERENCE ON THE 4th INDUSTRIAL REVOLUTION AND INFORMATION TECHNOLOGY
Volume: 1
Issue: 1
First page number: 65
Last page number: 68
Number of pages: 4
Appears in Collections:1st INTERNATIONAL CONFERENCE ON THE 4th INDUSTRIAL REVOLUTION AND INFORMATION TECHNOLOGY

Files in This Item:
File Description SizeFormat 
Industy_4-65-68.pdf268.2 kBAdobe PDFView/Open


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.