OpenAI：GPT-4 技术报告（英文版）.pdfVIP免费

下载本文档

阅读 0
下载 0
格式 pdf
大小 944.03 KB
约98页
2024-06-26
收藏
评论
点赞(0)
海报
举报

/98

GPT-4TechnicalReportOpenAI∗AbstractWereportthedevelopmentofGPT-4,alarge-scale,multimodalmodelwhichcanacceptimageandtextinputsandproducetextoutputs.Whilelesscapablethanhumansinmanyreal-worldscenarios,GPT-4exhibitshuman-levelperformanceonvariousprofessionalandacademicbenchmarks,includingpassingasimulatedbarexamwithascorearoundthetop10%oftesttakers.GPT-4isaTransformer-basedmodelpre-trainedtopredictthenexttokeninadocument.Thepost-trainingalignmentprocessresultsinimprovedperformanceonmeasuresoffactualityandadherencetodesiredbehavior.Acorecomponentofthisprojectwasdevelopinginfrastructureandoptimizationmethodsthatbehavepredictablyacrossawiderangeofscales.ThisallowedustoaccuratelypredictsomeaspectsofGPT-4’sperformancebasedonmodelstrainedwithnomorethan1/1,000ththecomputeofGPT-4.1IntroductionThistechnicalreportpresentsGPT-4,alargemultimodalmodelcapableofprocessingimageandtextinputsandproducingtextoutputs.Suchmodelsareanimportantareaofstudyastheyhavethepotentialtobeusedinawiderangeofapplications,suchasdialoguesystems,textsummarization,andmachinetranslation.Assuch,theyhavebeenthesubjectofsubstantialinterestandprogressinrecentyears[1–28].Oneofthemaingoalsofdevelopingsuchmodelsistoimprovetheirabilitytounderstandandgeneratenaturallanguagetext,particularlyinmorecomplexandnuancedscenarios.Totestitscapabilitiesinsuchscenarios,GPT-4wasevaluatedonavarietyofexamsoriginallydesignedforhumans.Intheseevaluationsitperformsquitewellandoftenoutscoresthevastmajorityofhumantesttakers.Forexample,onasimulatedbarexam,GPT-4achievesascorethatfallsinthetop10%oftesttakers.ThiscontrastswithGPT-3.5,whichscoresinthebottom10%.OnasuiteoftraditionalNLPbenchmarks,GPT-4outperformsbothpreviouslargelanguagemodelsandmoststate-of-the-artsystems(whichoftenhavebenchmark-specifictrainingorhand-engineering).OntheMMLUbenchmark[29,30],anEnglish-languagesuiteofmultiple-choicequestionscovering57subjects,GPT-4notonlyoutperformsexistingmodelsbyaconsiderablemargininEnglish,butalsodemonstratesstrongperformanceinotherlan...

1、当您付费下载文档后，您只拥有了使用权限，并不意味着购买了版权，文档只能用于自身使用，不得用于其他商业用途（如 [转卖]进行直接盈利或[编辑后售卖]进行间接盈利）。
2、本站所有内容均由合作方或网友上传，本站不对文档的完整性、权威性及其观点立场正确性做任何保证或承诺！文档内容仅供研究参考，付费前请自行鉴别。
3、如文档内容存在违规，或者侵犯商业秘密、侵犯著作权等，请点击“违规举报”。

碎片内容