Please use this identifier to cite or link to this item:
http://dspace.azjhpc.org/xmlui/handle/123456789/194| Title: | How chatgpt Works: Understanding the Architecture and Training Process of chatgpt |
| Authors: | Hajiyev, Aligulu |
| Keywords: | chatgpt;openai;probability distribution;self-attention mechanisms |
| Issue Date: | 11-May-2023 |
| Publisher: | Azərbaycan Dövlət Neft və Sənaye Universiteti |
| Abstract: | Chatgpt is an openai conversational AI system built on a transformer architecture with self-attention methods. Openai used a vast quantity of text data from the internet to train the model, and the machine learnt from the data via unsupervised learning. The model fine-tuned on a smaller dataset of talks following training to increase its ability to provide coherent and contextually relevant replies. During inference, the model analyses the input text and develops a probability distribution across all potential answers, after which the response with the highest probability chosen as the output. Overall, chatgpt is a remarkable achievement of natural language processing and deep learning, with several potential applications in customer assistance, education, content development, and targeted marketing. |
| URI: | http://dspace.azjhpc.org/xmlui/handle/123456789/194 |
| Journal Title: | 1st INTERNATIONAL CONFERENCE ON THE 4th INDUSTRIAL REVOLUTION AND INFORMATION TECHNOLOGY |
| metadata.dc.source.booktitle: | 1st INTERNATIONAL CONFERENCE ON THE 4th INDUSTRIAL REVOLUTION AND INFORMATION TECHNOLOGY |
| Volume: | 1 |
| Issue: | 1 |
| First page number: | 65 |
| Last page number: | 68 |
| Number of pages: | 4 |
| Appears in Collections: | 1st INTERNATIONAL CONFERENCE ON THE 4th INDUSTRIAL REVOLUTION AND INFORMATION TECHNOLOGY |
Files in This Item:
| File | Description | Size | Format | |
|---|---|---|---|---|
| Industy_4-65-68.pdf | 268.2 kB | Adobe PDF | View/Open |
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.