ATTENTION

tf.keras.layers.Attention: Dot-product attention layer, a.k.a. Luong-style attention.

tf.keras.layers.Attention( View source on GitHub ) Dot-product attention layer, a.k.a. Luong-style attention. Inherits From: Layer, Module tf.keras.la ......

attention Dot-product Luong-style Attention product更新时间 2024-01-06

Self-attention小小实践

目录公式 1 不带权重的自注意力机制公式 2 带权重的自注意力机制公式 1 不带权重的自注意力机制 \[Attention(X) = softmax(\frac{X\cdot{X^T}}{\sqrt{dim_X}})\cdot X \]示例程序： import numpy as np emb_di ......

Self-attention attention Self更新时间 2023-12-24

Is Attention Better Than Matrix Decomposition?

Is Attention Better Than Matrix Decomposition? * Authors: [[Zhengyang Geng]], [[Meng-Hao Guo]], [[Hongxu Chen]], [[Xia Li]], [[Ke Wei]], [[Zhouchen Li ......

Decomposition Attention Better Matrix Than更新时间 2023-12-18

SegNeXt: Rethinking Convolutional Attention Design for Semantic Segmentation

SegNeXt: Rethinking Convolutional Attention Design for Semantic Segmentation * Authors: [[Meng-Hao Guo]], [[Cheng-Ze Lu]], [[Qibin Hou]], [[Zhengning ......

Convolutional Segmentation Rethinking Attention Semantic更新时间 2023-12-18

CCNet: Criss-Cross Attention for Semantic Segmentation

CCNet: Criss-Cross Attention for Semantic Segmentation * Authors: [[Zilong Huang]], [[Xinggang Wang]], [[Yunchao Wei]], [[Lichao Huang]], [[Humphrey S ......

Segmentation Criss-Cross Attention Semantic CCNet更新时间 2023-12-18

Dual Attention Network for Scene Segmentation：双线并行的注意力

Dual Attention Network for Scene Segmentation * Authors: [[Jun Fu]], [[Jing Liu]], [[Haijie Tian]], [[Yong Li]], [[Yongjun Bao]], [[Zhiwei Fang]], [[H ......

Segmentation 注意力 Attention Network Scene更新时间 2023-12-18

Attention Is All You Need

Attention Is All You Need * Authors: [[Ashish Vaswani]], [[Noam Shazeer]], [[Niki Parmar]], [[Jakob Uszkoreit]], [[Llion Jones]], [[Aidan N. Gomez]], ......

Attention Need All You Is更新时间 2023-12-18

Expectation-Maximization Attention Networks for Semantic Segmentation 使用了EM算法的注意力

Expectation-Maximization Attention Networks for Semantic Segmentation * Authors: [[Xia Li]], [[Zhisheng Zhong]], [[Jianlong Wu]], [[Yibo Yang]], [[Zho ......

Expectation-Maximization Maximization Segmentation 算法 Expectation更新时间 2023-12-18

CBAM: Convolutional Block Attention Module

CBAM: Convolutional Block Attention Module * Authors: [[Sanghyun Woo]], [[Jongchan Park]], [[Joon-Young Lee]], [[In So Kweon]] doi:https://doi.org/10. ......

Convolutional Attention Module Block CBAM更新时间 2023-12-18

PSANet: Point-wise Spatial Attention Network for Scene Parsing双向注意力

PSANet: Point-wise Spatial Attention Network for Scene Parsing * Authors: [[Hengshuang Zhao]], [[Yi Zhang]], [[Shu Liu]], [[Jianping Shi]], [[Chen Cha ......

双向注意力 Point-wise Attention Network更新时间 2023-12-18

Object Tracking Network Based on Deformable Attention Mechanism

Object Tracking Network Based on Deformable Attention Mechanism Local library 初读印象 comment:: （DeTrack）采用基于可变形注意力机制的编码器模块和基于自注意力机制的编码器模块相结合的方式进行特征交互。基于 ......

Deformable Attention Mechanism Tracking Network更新时间 2023-12-18

BiFormer: Vision Transformer with Bi-Level Routing Attention 使用超标记的轻量ViT

alias: Zhu2023a tags: 超标记注意力 rating: ⭐ share: false ptype: article BiFormer: Vision Transformer with Bi-Level Routing Attention * Authors: [[Lei Zhu] ......

轻量 Transformer 标记 Attention BiFormer更新时间 2023-12-18

A Deformable Attention Network for High-Resolution Remote Sensing Images Semantic Segmentation可变形注意力

A Deformable Attention Network for High-Resolution Remote Sensing Images Semantic Segmentation * Authors: [[Renxiang Zuo]], [[Guangyun Zhang]], [[Rong ......

High-Resolution Segmentation 注意力 Deformable Resolution更新时间 2023-12-18

GCGP：Global Context and Geometric Priors for Effective Non-Local Self-Attention加入了上下文信息和几何先验的注意力

Global Context and Geometric Priors for Effective Non-Local Self-Attention * Authors: [[Woo S]] 初读印象 comment:: （GCGP）提出了一个新的关系推理模块，它包含了一个上下文化的对角矩阵和二维相 ......

先验上下文 Self-Attention 几何注意力更新时间 2023-12-18

Flash-attention 2.3.2 支持 Windows了，但是我的2080ti是不支持的。

不久前Flash-attention 2.3.2 终于支持了 Windows，推荐直接使用大神编译好的whl安装 github.com/bdashore3/flash-attention/releasesstable diffusion webui flash-attention2性能测试安装环境 ......

Flash-attention attention Windows Flash 2080更新时间 2023-12-13

【论文解读】System 2 Attention提高大语言模型客观性和事实性

本文简要介绍了论文“System 2 Attention (is something you might need too) ”的相关工作。基于transformer的大语言模型（LLM）中的软注意很容易将上下文中的不相关信息合并到其潜在的表征中，这将对下一token的生成产生不利影响。为了帮助纠正... ......

事实性客观性 Attention 模型客观更新时间 2023-12-13

The Devil Is in the Details: Window-based Attention for Image Compression

目录简介简介基于CNN的模型的一个主要缺点是 cNN结构不是为捕捉局部冗余而设计的，尤其是非重复纹理，这严重影响了重建质量。受视觉转换器（ViT）和Swin Transformer最新进展的启发，我们发现将局部感知注意机制与全局相关特征学习相结合可以满足图像压缩的期望。介绍了一种更简单有效的基 ......

Window-based Compression Attention Details Window更新时间 2023-12-13

Attention 2015-今

现在attention的热度已经过去了，基本上所有的attention都是transformer的kqv形式的，甚至只要说道attention，默认就是transformer的attention。为避免遗忘历史，我这里做一个小总结。繁杂的att我就不去了解了，只了解下经典的。以下以\(h_i\) ......

Attention 2015更新时间 2023-12-11

System 2 Attention：可以提高不同LLM问题的推理能力

推理正在成为大型语言模型(llm)关注的下一个主要领域。尽管llm拥有先进的能力，但大多数llm经常被简单的错误绊倒，显示出他们在推理方面的局限性。这些模型可能会被上下文中的不相关细节所误导，或者受到输入提示中的偏差的影响。而后一种倾向被称为谄媚，也就是说模型会更偏向与输入一致，而不管准确性如何。人 ......

Attention 能力 System 问题 LLM更新时间 2023-12-10

Predicting gene expression from histone modifications with self-attention based neural networks and transfer learning

Predicting gene expression from histone modifications with self-attention based neural networks and transfer learning Yuchi Chen 1, Minzhu Xie 1, Jie ......

self-attention modifications Predicting expression attention更新时间 2023-12-08

论文：FEED-FORWARD NETWORKS WITH ATTENTION CAN SOLVE SOME LONG-TERM MEMORY PROBLEMS

题目：FEED-FORWARD NETWORKS WITH ATTENTION CAN SOLVE SOME LONG-TERM MEMORY PROBLEMS” (Raffel 和 Ellis, 2016, p. 1) “带有注意力的前馈网络可以解决一些长期记忆问题” (Raffel 和 Elli ......

FEED-FORWARD ATTENTION LONG-TERM NETWORKS PROBLEMS更新时间 2023-11-29

【略读论文|时序知识图谱补全】Tucker Decomposition with Frequency Attention for Temporal Knowledge Graph Completion

会议：ACL，时间：2023，学校：北京航空航天大学，多伦多大学关键词：基于张量分解；频率注意力；正则化摘要：之前基于张量分解的TKGC模型存在仅独立考虑一种关系与一个时间戳的组合，忽略了嵌入的全局性质的问题。本文的方法：一种频率注意力（FA）模型来捕获一个关系与整个时间戳之间的全局时间依赖 ......

时序图谱 Decomposition Completion Attention更新时间 2023-11-27

【略读论文|时序知识图谱补全】DREAM: Adaptive Reinforcement Learning based on Attention Mechanism for Temporal Knowledge Graph Reasoning

会议：SIGIR，时间：2023，学校：苏州大学计算机科学与技术学院，澳大利亚昆士兰布里斯班大学信息技术与电气工程学院，Griffith大学金海岸信息通信技术学院摘要：原因：现在的时序知识图谱推理方法无法生成显式推理路径，缺乏可解释性。方法迁移：由于强化学习 (RL) 用于传统知识图谱上的多跳 ......

时序图谱 Reinforcement Attention Knowledge更新时间 2023-11-21

共84篇 :1/3页 首页上一页123下一页尾页

526互联