xformer (v8).pptxVIP免费

下载本文档

阅读 0
下载 0
格式 pptx
大小 1.28 MB
约40页
2024-04-29
收藏
评论
点赞(0)
海报
举报

在线预览已结束，请下载后查看完整版，加入VIP享文档下载特权

/40

文本预览下载提示常见问题

各式各樣的AttentionHung-yiLee李宏毅Prerequisitehttps://youtu.be/hYdO9CscNeshttps://youtu.be/gmsMY5kc-zw【機器學習2021】自注意力機制(Self-attention)(上)【機器學習2021】自注意力機制(Self-attention)(下)3ToLearnMore…https://arxiv.org/abs/2009.06732EfficientTransformers:ASurveyLongRangeArena:ABenchmarkforEfficientTransformershttps://arxiv.org/abs/2011.04006Howtomakeself-attentionefficient?AttentionMatrixkeyqueryNotice•Self-attentionisonlyamoduleinalargernetwork.•UsuallydevelopedforimageprocessingSkipSomeCalculationswithHumanKnowledgeCanwefillinsomevalueswithhumanknowledge?LocalAttention/TruncatedAttentionCalculateattentionweightSetto0……SimilarwithCNNkeyqueryStrideAttention…………GlobalAttentionAddspecialtokenintooriginalsequencespecialtoken=“token中的里長伯“Noattentionbetweennon-specialtokenManyDifferentChoices…Differentheadsusedifferentpatterns.小孩子才做選擇．．．ManyDifferentChoices…•Longformer•BigBirdhttps://arxiv.org/abs/2004.05150https://arxiv.org/abs/2007.14062CanweonlyfocusonCriticalParts?keyquerylargevaluesmallvalue•Directlysetto0•SmallerinfluenceonresultsHowtoquicklyestimatetheportionwithsmallattentionweights?ClusteringkeyqueryReformerhttps://openreview.net/forum?id=rkgNKkHtvBRoutingTransformerhttps://arxiv.org/abs/2003.05997Clusteringbasedonsimilarity1111122233333444(approximate&fast)Step1ClusteringkeyqueryStep2Belongtothesamecluster,thencalculateattentionweightNotthesamecluster,setto0LearnablePatternsSinkhornSortingNetworkhttps://arxiv.org/abs/2002.11296(simplifiedversion)keyqueryInputsequenceNNJointlylearnedAgridshouldbeskippedornotisdecidedbyanotherlearnedmoduleDoweneedfullattentionmatrix?ManyredundantcolumnsLinformerhttps://arxiv.org/abs/2006.04768keyqueryLowRankkeyqueryvalueRepresentativekeysCanwereducethenumberofqueries?outputchangeoutputsequencelengthReduceNumberofKeysConvConvConvConvLinformerhttps://arxiv.org/abs/2006.04768CompressedAttentionhttps://arxiv.org/abs/1801.10198Review=====AttentionMechanismisthree-matrixMultiplicationsoftma...

1、当您付费下载文档后，您只拥有了使用权限，并不意味着购买了版权，文档只能用于自身使用，不得用于其他商业用途（如 [转卖]进行直接盈利或[编辑后售卖]进行间接盈利）。
2、本站所有内容均由合作方或网友上传，本站不对文档的完整性、权威性及其观点立场正确性做任何保证或承诺！文档内容仅供研究参考，付费前请自行鉴别。
3、如文档内容存在违规，或者侵犯商业秘密、侵犯著作权等，请点击“违规举报”。

碎片内容