self-attention attention self

[论文阅读] Self-conditioned Image Generation via Generating Representations

Pre title: Self-conditioned Image Generation via Generating Representations accepted: arXiv 2023 paper: https://arxiv.org/abs/2312.03701 code: https:/ ......

tf.keras.layers.Attention: Dot-product attention layer, a.k.a. Luong-style attention.

tf.keras.layers.Attention( View source on GitHub ) Dot-product attention layer, a.k.a. Luong-style attention. Inherits From: Layer, Module tf.keras.la ......

Self-attention小小实践

目录公式 1 不带权重的自注意力机制公式 2 带权重的自注意力机制 公式 1 不带权重的自注意力机制 \[Attention(X) = softmax(\frac{X\cdot{X^T}}{\sqrt{dim_X}})\cdot X \]示例程序: import numpy as np emb_di ......
Self-attention attention Self

论文阅读-Self-supervised and Interpretable Data Cleaning with Sequence Generative Adversarial Networks

1. GARF 简介 代码地址:https://github.com/PJinfeng/Garf-master 基于 SeqGAN 提出了一种自监督、数据驱动的数据清洗框架——GARF。 GARF 的数据清洗分为两个步骤: 规则生成 (Rule generation with SeqGAN):利用 ......

Is Attention Better Than Matrix Decomposition?

Is Attention Better Than Matrix Decomposition? * Authors: [[Zhengyang Geng]], [[Meng-Hao Guo]], [[Hongxu Chen]], [[Xia Li]], [[Ke Wei]], [[Zhouchen Li ......
Decomposition Attention Better Matrix Than

SegNeXt: Rethinking Convolutional Attention Design for Semantic Segmentation

SegNeXt: Rethinking Convolutional Attention Design for Semantic Segmentation * Authors: [[Meng-Hao Guo]], [[Cheng-Ze Lu]], [[Qibin Hou]], [[Zhengning ......

CCNet: Criss-Cross Attention for Semantic Segmentation

CCNet: Criss-Cross Attention for Semantic Segmentation * Authors: [[Zilong Huang]], [[Xinggang Wang]], [[Yunchao Wei]], [[Lichao Huang]], [[Humphrey S ......

Dual Attention Network for Scene Segmentation:双线并行的注意力

Dual Attention Network for Scene Segmentation * Authors: [[Jun Fu]], [[Jing Liu]], [[Haijie Tian]], [[Yong Li]], [[Yongjun Bao]], [[Zhiwei Fang]], [[H ......

Attention Is All You Need

Attention Is All You Need * Authors: [[Ashish Vaswani]], [[Noam Shazeer]], [[Niki Parmar]], [[Jakob Uszkoreit]], [[Llion Jones]], [[Aidan N. Gomez]], ......
Attention Need All You Is

Expectation-Maximization Attention Networks for Semantic Segmentation 使用了EM算法的注意力

Expectation-Maximization Attention Networks for Semantic Segmentation * Authors: [[Xia Li]], [[Zhisheng Zhong]], [[Jianlong Wu]], [[Yibo Yang]], [[Zho ......

CBAM: Convolutional Block Attention Module

CBAM: Convolutional Block Attention Module * Authors: [[Sanghyun Woo]], [[Jongchan Park]], [[Joon-Young Lee]], [[In So Kweon]] doi:https://doi.org/10. ......
Convolutional Attention Module Block CBAM

PSANet: Point-wise Spatial Attention Network for Scene Parsing双向注意力

PSANet: Point-wise Spatial Attention Network for Scene Parsing * Authors: [[Hengshuang Zhao]], [[Yi Zhang]], [[Shu Liu]], [[Jianping Shi]], [[Chen Cha ......

Object Tracking Network Based on Deformable Attention Mechanism

Object Tracking Network Based on Deformable Attention Mechanism Local library 初读印象 comment:: (DeTrack)采用基于可变形注意力机制的编码器模块和基于自注意力机制的编码器模块相结合的方式进行特征交互。基于 ......

BiFormer: Vision Transformer with Bi-Level Routing Attention 使用超标记的轻量ViT

alias: Zhu2023a tags: 超标记 注意力 rating: ⭐ share: false ptype: article BiFormer: Vision Transformer with Bi-Level Routing Attention * Authors: [[Lei Zhu] ......
轻量 Transformer 标记 Attention BiFormer

A Deformable Attention Network for High-Resolution Remote Sensing Images Semantic Segmentation可变形注意力

A Deformable Attention Network for High-Resolution Remote Sensing Images Semantic Segmentation * Authors: [[Renxiang Zuo]], [[Guangyun Zhang]], [[Rong ......

GCGP:Global Context and Geometric Priors for Effective Non-Local Self-Attention加入了上下文信息和几何先验的注意力

Global Context and Geometric Priors for Effective Non-Local Self-Attention * Authors: [[Woo S]] 初读印象 comment:: (GCGP)提出了一个新的关系推理模块,它包含了一个上下文化的对角矩阵和二维相 ......

Fully Attentional Network for Semantic Segmentation:FLANet

Fully Attentional Network for Semantic Segmentation * Authors: [[Qi Song]], [[Jie Li]], [[Chenghong Li]], [[Hao Guo]], [[Rui Huang]] 初读印象 comment:: (F ......

Flash-attention 2.3.2 支持 Windows了,但是我的2080ti是不支持的。

不久前Flash-attention 2.3.2 终于支持了 Windows,推荐直接使用大神编译好的whl安装 github.com/bdashore3/flash-attention/releasesstable diffusion webui flash-attention2性能测试 安装环境 ......
Flash-attention attention Windows Flash 2080

【论文解读】System 2 Attention提高大语言模型客观性和事实性

本文简要介绍了论文“System 2 Attention (is something you might need too) ”的相关工作。基于transformer的大语言模型(LLM)中的软注意很容易将上下文中的不相关信息合并到其潜在的表征中,这将对下一token的生成产生不利影响。为了帮助纠正... ......
事实性 客观性 Attention 模型 客观

The Devil Is in the Details: Window-based Attention for Image Compression

目录简介 简介 基于CNN的模型的一个主要缺点是 cNN结构不是为捕捉局部冗余而设计的,尤其是非重复纹理,这严重影响了重建质量。受视觉转换器(ViT)和Swin Transformer最新进展的启发,我们发现将局部感知注意机制与全局相关特征学习相结合可以满足图像压缩的期望。 介绍了一种更简单有效的基 ......

论文笔记: Attributed Graph Clustering: A Deep Attentional Embedding Approach

论文笔记: Attributed Graph Clustering: A Deep Attentional Embedding Approach 中文名称: 属性图聚类:一种深度注意力嵌入方法 论文链接: https://arxiv.org/abs/1906.06532 背景: ​ 图聚类是发现网络 ......

unity Transform 的 Rotate(xAngle: float, yAngle: float, zAngle: float, relativeTo: Space = Space.Self, Space.World); 刨根问底

public class demoword2 : MonoBehaviour { // Start is called before the first frame update void Start() { //transform.Rotate(60, 70, 80, Space.World); ......

Attention 2015-今

现在attention的热度已经过去了,基本上所有的attention都是transformer的kqv形式的,甚至只要说道attention,默认就是transformer的attention。 为避免遗忘历史,我这里做一个小总结。繁杂的att我就不去了解了,只了解下经典的。 以下以\(h_i\) ......
Attention 2015

System 2 Attention:可以提高不同LLM问题的推理能力

推理正在成为大型语言模型(llm)关注的下一个主要领域。尽管llm拥有先进的能力,但大多数llm经常被简单的错误绊倒,显示出他们在推理方面的局限性。这些模型可能会被上下文中的不相关细节所误导,或者受到输入提示中的偏差的影响。而后一种倾向被称为谄媚,也就是说模型会更偏向与输入一致,而不管准确性如何。人 ......
Attention 能力 System 问题 LLM

Predicting gene expression from histone modifications with self-attention based neural networks and transfer learning

Predicting gene expression from histone modifications with self-attention based neural networks and transfer learning Yuchi Chen 1, Minzhu Xie 1, Jie ......

[论文速览] Randomized Quantization@ A Generic Augmentation for Data Agnostic Self-supervised Learning

Pre title: Randomized Quantization: A Generic Augmentation for Data Agnostic Self-supervised Learning accepted: ICCV 2023 paper: https://arxiv.org/abs ......

论文:FEED-FORWARD NETWORKS WITH ATTENTION CAN SOLVE SOME LONG-TERM MEMORY PROBLEMS

题目:FEED-FORWARD NETWORKS WITH ATTENTION CAN SOLVE SOME LONG-TERM MEMORY PROBLEMS” (Raffel 和 Ellis, 2016, p. 1) “带有注意力的前馈网络可以解决一些长期记忆问题” (Raffel 和 Elli ......

CA-TCC: 半监督时间序列分类的自监督对比表征学习《Self-supervised Contrastive Representation Learning for Semi-supervised Time-Series Classification》(时间序列、时序表征、时间和上下文对比、对比学习、自监督学习、半监督学习、TS-TCC的扩展版)

现在是2023年11月27日,10:48,今天把这篇论文看了。 论文:Self-supervised Contrastive Representation Learning for Semi-supervised Time-Series Classification GitHub:https://g ......
时间序列 时间 序列 supervised 时序

lua self使用例子

local funcs = {} function funcs:callMe() local tt = self.name --值为hello print(tt) end f = funcs["callMe"] -- 调用方法1: f({name = “hello"}) -- 调用方法2: loca ......
例子 self lua

【略读论文|时序知识图谱补全】Tucker Decomposition with Frequency Attention for Temporal Knowledge Graph Completion

会议:ACL,时间:2023,学校:北京航空航天大学,多伦多大学 关键词:基于张量分解;频率注意力;正则化 摘要: 之前基于张量分解的TKGC模型存在仅独立考虑一种关系与一个时间戳的组合,忽略了嵌入的全局性质的问题。 本文的方法:一种频率注意力(FA)模型来捕获一个关系与整个时间戳之间的全局时间依赖 ......
共200篇  :1/7页 首页上一页1下一页尾页