recognition-vgg convolutional large-scale recognition

【五期李伟平】CCF-B(TFS'23)Consensus Reaching Process With Multiobjective Optimization for Large-Scale Group Decision Making With Cooperative Game

Peng Wu, Fengen Li, Jie Zhao, et al. Consensus Reaching Process With Multiobjective Optimization for Large-Scale Group Decision Making With Cooperativ ......

MMGCN: Multi-modal Graph Convolution Network for Personalized Recommendation of Micro-video

目录概符号说明MMGCN代码 Wei Y., Wang X., Nie L., He X., Hong R. and Chua T. MMGCN: Multi-modal graph convolution network for personalized recommendation of mic ......

Hierarchical Clustering-based Personalized Federated Learning for Robust and Fair Human Activity Recognition-2023

任务:人类活动识别任务Human Activity Recognition HAR 指标:系统准确性、公平性、鲁棒性、可扩展性 方法:1. 提出一个带有层次聚类(针对鲁棒性和公平的HAR)个性化的FL框架FedCHAR;通过聚类(利用用户之间的内在相似关系)提高模型性能的准确性、公平性、鲁棒性。 2 ......

Deep Residual Learning for Image Recognition:ResNet

Deep Residual Learning for Image Recognition * Authors: [[Kaiming He]], [[Xiangyu Zhang]], [[Shaoqing Ren]], [[Jian Sun]] DOI: 10.1109/CVPR.2016.90 初读 ......
Recognition Residual Learning ResNet Image

Local Relation Networks for Image Recognition: LRNet

Local Relation Networks for Image Recognition * Authors: [[Han Hu]], [[Zheng Zhang]], [[Zhenda Xie]], [[Stephen Lin]] DOI: 10.1109/ICCV.2019.00356 @in ......
Recognition Relation Networks Local Image

SegNeXt: Rethinking Convolutional Attention Design for Semantic Segmentation

SegNeXt: Rethinking Convolutional Attention Design for Semantic Segmentation * Authors: [[Meng-Hao Guo]], [[Cheng-Ze Lu]], [[Qibin Hou]], [[Zhengning ......

InternImage: Exploring Large-Scale Vision Foundation Models with Deformable Convolutions 可变形卷积v3

InternImage: Exploring Large-Scale Vision Foundation Models with Deformable Convolutions * Authors: [[Wenhai Wang]], [[Jifeng Dai]], [[Zhe Chen]], [[Z ......

Bottleneck Transformers for Visual Recognition

Bottleneck Transformers for Visual Recognition * Authors: [[Aravind Srinivas]], [[Tsung-Yi Lin]], [[Niki Parmar]], [[Jonathon Shlens]], [[Pieter Abbee ......

Fully convolutional networks for semantic segmentation

Fully convolutional networks for semantic segmentation * Authors: [[Jonathan Long]], [[Evan Shelhamer]], [[Trevor Darrell]] DOI: 10.1109/CVPR.2015.729 ......

U-Net: Convolutional Networks for Biomedical Image Segmentation

U-Net: Convolutional Networks for Biomedical Image Segmentation * Authors: [[Olaf Ronneberger]], [[Philipp Fischer]], [[Thomas Brox]] Local library 初读 ......

Real-Time Single Image and Video Super-Resolution Using an Efficient Sub-Pixel Convolutional Neural Network

Real-Time Single Image and Video Super-Resolution Using an Efficient Sub-Pixel Convolutional Neural Network * Authors: [[Wenzhe Shi]], [[Jose Caballer ......

CBAM: Convolutional Block Attention Module

CBAM: Convolutional Block Attention Module * Authors: [[Sanghyun Woo]], [[Jongchan Park]], [[Joon-Young Lee]], [[In So Kweon]] doi:https://doi.org/10. ......
Convolutional Attention Module Block CBAM

Scale-Prior Deformable Convolution for Exemplar-Guided Class-Agnostic Counting

Scale-Prior Deformable Convolution for Exemplar-Guided Class-Agnostic Counting 初读印象 comment:: (计数用的一个网络)提出了一个标度优先的可变形卷积,将典范的信息,例如标度,整合到计数网络主干中。 动机 本文考 ......

Adaptive Sparse Convolutional Networks with Global Context Enhancement for Faster Object Detection on Drone Images

Adaptive Sparse Convolutional Networks with Global Context Enhancement for Faster Object Detection on Drone Images * Authors: [[Bowei Du]], [[Yecheng ......

Conv2Former: A Simple Transformer-Style ConvNet for Visual Recognition:使用大核卷积调制来简化注意力

Conv2Former: A Simple Transformer-Style ConvNet for Visual Recognition * Authors: [[Qibin Hou]], [[Cheng-Ze Lu]], [[Ming-Ming Cheng]], [[Jiashi Feng]] ......

【论文阅读笔记】【OCR-文本识别】 SEED: Semantics Enhanced Encoder-Decoder Framework for Scene Text Recognition

SEED CVPR 2020 读论文思考的问题 论文试图解决什么问题?写作背景是什么? 问题: 如何利用全局的语义信息提高文本识别模型对低质量文本的鲁棒性和识别效果? 背景: 以往的基于 encoder-decoder 的文本识别方法通常基于局部的视觉特征解码出文本,忽略了对单词显式的全局语义信息的 ......

论文精读:STMGCN利用时空多图卷积网络进行移动边缘计算驱动船舶轨迹预测(STMGCN: Mobile Edge Computing-Empowered Vessel Trajectory Prediction Using Spatio-Temporal Multigraph Convolutional Network)

《STMGCN: Mobile Edge Computing-Empowered Vessel Trajectory Prediction Using Spatio-Temporal Multigraph Convolutional Network》 论文链接:https://doi.org/10. ......

论文精读:基于具有时空感知的稀疏多图卷积混合网络的大数据驱动船舶轨迹预测(Big data driven trajectory prediction based on sparse multi-graph convolutional hybrid network withspatio-temporal awareness)

论文精读:基于具有时空感知的稀疏多图卷积混合网络的大数据驱动船舶轨迹预测 《Big data driven vessel trajectory prediction based on sparse multi-graph convolutional hybrid network with spati ......

Exercise 2 - Handwriting Recognition

Exercise 2 - Handwriting Recognition 在课程中,您学习了如何使用Fashion MNIST 进行分类,这是一个包含服装项目的数据集。还有一个类似的数据集叫做 MNIST,其中包含手写项目--数字 0 到 9。 编写一个 MNIST 分类器,训练达到 99% 或以上 ......
Handwriting Recognition Exercise

Exercise 3 - Convolutions

Exercise 3 - Convolutions 在视频中,您了解了如何使用卷积来改进时尚 MNIST。在练习中,请看您能否仅使用一个卷积层和一个 MaxPooling 2D 将 MNIST 的准确率提高到 99.8% 或更高。一旦准确率超过这一水平,就应停止训练。这应该在 20 个历元以内完成, ......
Convolutions Exercise

【论文阅读笔记】【OCR-文本识别】 LISTER: Neighbor Decoding for Length-Insensitive Scene Text Recognition

LISTER ICCV 2023 读论文思考的问题 论文试图解决什么问题? 由于长尾效应和错误累积等原因,现有的文本识别模型对于长文本的识别能力较差 如何提高模型对于长度较长的文本的识别能力? 文章提出了什么样的解决方法? 提出了 LISTER 模型,引入了 neighbor matrix 的概念, ......

how convolutions work

how convolutions work 让我们在二维灰度图像上创建一个基本卷积,探索卷积是如何工作的。首先,我们可以从 scipy 中获取 "asccent "图像来加载图像。这是一张漂亮的内置图片,有很多角度和线条。 import cv2 import numpy as np from sci ......
convolutions work how

Improving Computer Vision Accuracy using Convolutions

Improving Computer Vision Accuracy using Convolutions ‍ 在前面的课程中,你们了解了如何使用包含三层的深度神经网络(DNN)进行时装识别,这三层分别是输入层(数据的形状)、输出层(所需输出的形状)和隐藏层。你试验了不同大小的隐藏层、训练epoch ......

[ABC315Ex] Typical Convolution Problem

题目链接 首先观察到这个形式,容易发现它和常规的卷积不同点就在于:题目给出的求和定义中,\(\sum\) 符号下面的式子是 \(i+j<N\) 求和而不是 \(i+j=N\)。 为了方便计算,我们引入: \[G_n=\sum_{i+j<N}F_iF_j \]我们发现,假设所有 \(F_{1\sim{ ......
Convolution Typical Problem ABC 315

Convolutional Neural Networks on Graphs with Chebyshev Approximation, Revisited

目录概符号说明MotivationChebNetII代码 He M., Wei Z. and Wen J. Convolutional neural networks on graphs with chebyshev approximation, revisited. NIPS, 2022. 概 作 ......

ZYNQ_Target-Recognition Project complete

ZYNQ_Target-Recognition 描述:实现了一个卷积神经网络加速器,成功搭载Yolov3tiny。配合摄像头采集+显示器回显环路,构建了一个高性能实时目标识别与检测系统。 实现方式: Verilog实现卷积加速器的设计, C语言实现Zynq PS端的开发, Python实现神经网络的 ......

MS-TCN++: Multi-Stage Temporal Convolutional Network for Action Segmentation

论文名: MS-TCN++: Multi-Stage Temporal Convolutional Network for Action Segmentation "MS-TCN++: 用于动作分割的多阶段时域卷积" Shi-Jie Li#, Yazan AbuFarha#, Yun Liu, Mi ......

【论文阅读笔记】【OCR-文本识别】 CLIPTER: Looking at the Bigger Picture in Scene Text Recognition

CLIPTER ICCV 2023 读论文思考的问题 论文试图解决什么问题? 现有的文本识别方法只关注于局部截取的文本区域,识别模型并没有利用全图的上下文信息,导致其可能对有挑战性的文本的识别效果较差 能否以某种方式使识别器利用上global feature的信息? 文章提出了什么样的解决方法? 提 ......
Recognition 文本 CLIPTER Looking Picture

【论文阅读笔记】【OCR-文本识别】 Scene Text Recognition with Permuted Autoregressive Sequence Models

PARSeq ECCV 2022 读论文思考的问题 论文试图解决什么问题? 一些文本识别模型会对 semantic 信息建模,从而辅助某些困难情况下的文本识别 传统的 auto-regressive 方式限制了语义信息的传输方向;双向的 auto-regressive 聚合增加了不必要的计算量和复杂 ......