inference computer learning vision

Deep Residual Learning for Image Recognition：ResNet

Deep Residual Learning for Image Recognition * Authors: [[Kaiming He]], [[Xiangyu Zhang]], [[Shaoqing Ren]], [[Jian Sun]] DOI: 10.1109/CVPR.2016.90 初读 ......

Recognition Residual Learning ResNet Image更新时间 2023-12-18

InternImage: Exploring Large-Scale Vision Foundation Models with Deformable Convolutions 可变形卷积v3

InternImage: Exploring Large-Scale Vision Foundation Models with Deformable Convolutions * Authors: [[Wenhai Wang]], [[Jifeng Dai]], [[Zhe Chen]], [[Z ......

卷积 Convolutions InternImage Large-Scale Deformable更新时间 2023-12-18

SegViT: Semantic Segmentation with Plain Vision Transformers

SegViT: Semantic Segmentation with Plain Vision Transformers * Authors: [[Bowen Zhang]], [[Zhi Tian]], [[Quan Tang]], [[Xiangxiang Chu]], [[Xiaolin We ......

Segmentation Transformers Semantic SegViT Vision更新时间 2023-12-18

BiFormer: Vision Transformer with Bi-Level Routing Attention 使用超标记的轻量ViT

alias: Zhu2023a tags: 超标记注意力 rating: ⭐ share: false ptype: article BiFormer: Vision Transformer with Bi-Level Routing Attention * Authors: [[Lei Zhu] ......

轻量 Transformer 标记 Attention BiFormer更新时间 2023-12-18

Learning to Rank — xgboost 2.0.2

* [Learning to Rank — xgboost 2.0.2 documentation](https://xgboost.readthedocs.io/en/stable/tutorials/learning_to_rank.html)* [XGBoost的原理、公式推导、Python实 ......

Learning xgboost Rank to更新时间 2023-12-18

A Guide to Image and Video based Small Object Detection using Deep Learning : Case Study of Maritime Surveillance

A Guide to Image and Video based Small Object Detection using Deep Learning : Case Study of Maritime Surveillance 基于图像和视频的小对象指南使用深度学习进行检测：的案例研究海上监视 1 ......

Surveillance Detection Learning Maritime Object更新时间 2023-12-18

Occluded Person Re-Identification with Deep Learning: A Survey and Perspectives

应对行人信息丢失、噪声干扰、视角错位等挑战局部特征学习：人体分割、姿态估计、语义分割、属性注释、混合法 ***语义分割*** ECCV-2020）用于行人重识别的身份引导人类语义解析 https://blog.51cto.com/u_14300986/5466923 ......

Re-Identification Identification Perspectives Occluded Learning更新时间 2023-12-18

Rethinking and Improving Relative Position Encoding for Vision Transformer: ViT中的位置编码

Rethinking and Improving Relative Position Encoding for Vision Transformer * Authors: [[Kan Wu]], [[Houwen Peng]], [[Minghao Chen]], [[Jianlong Fu]], ......

Transformer Rethinking Improving Encoding Relative更新时间 2023-12-18

神经网络优化篇：机器学习基础（Basic Recipe for Machine Learning）

机器学习基础下图就是在训练神经网络用到的基本方法：（尝试这些方法，可能有用，可能没用）这是在训练神经网络时用到地基本方法，初始模型训练完成后，首先要知道算法的偏差高不高，如果偏差较高，试着评估训练集或训练数据的性能。如果偏差的确很高，甚至无法拟合训练集，那么要做的就是选择一个新的网络，比如含有更 ......

神经网络 Learning 神经机器 Machine更新时间 2023-12-18

【scikit-learn基础】--『预处理』之正则化

数据的预处理是数据分析，或者机器学习训练前的重要步骤。通过数据预处理，可以提高数据质量，处理数据的缺失值、异常值和重复值等问题，增加数据的准确性和可靠性整合不同数据，数据的来源和结构可能多种多样，分析和训练前要整合成一个数据集提高数据性能，对数据的值进行变换，规约等（比如无量纲化），让算法更加 ......

正则 scikit-learn 基础 scikit learn更新时间 2023-12-18

Machine Learning in Python

Metric Formula Interpretation Accuracy $ \frac{TP+TN}{TP+TN+FP+FN} $ Overall performance of model Precision $ \frac{TP}{TP+FN} $ How accurate the posi ......

Learning Machine Python in更新时间 2023-12-18

MetaFormer Is Actually What You Need for Vision：通用的ViT架构才是关键

MetaFormer Is Actually What You Need for Vision * Authors: [[Weihao Yu]], [[Mi Luo]], [[Pan Zhou]], [[Chenyang Si]], [[Yichen Zhou]], [[Xinchao Wang]] ......

MetaFormer 架构 Actually 关键 Vision更新时间 2023-12-17

Ansor：Generating High-Performance Tensor Program for Deep Learning

Ansor：Generating High-Performance Tensor Program for Deep Learning Abstract 高性能的张量程序对于保证深度神经网络的高效执行十分关键，但是在不同硬件平台上获取高性能的张量程序并不容易。近年的研究中，深度学习系统依赖硬件供应商提 ......

High-Performance Performance Generating Learning Program更新时间 2023-12-17

Open-World Object Manipulation using Pre-trained Vision-Language Models

概述提出MOO: Manipulation of Open-World Objects 用预训练的VLM在图像中标记instruction的object的坐标，传入policy进行控制，可以zero-shot泛化到novel object，还支持手指、点击输入指令。问题机器人泛化到训练中没有见 ......

Vision-Language Manipulation Pre-trained Open-World Language更新时间 2023-12-17

Swin Transformer: Hierarchical Vision Transformer using Shifted Windows详解

初读印象 comment:: (Swin-transformer)代码：https://github. com/microsoft/Swin-Transformer 动机将在nlp上主流的Transformer转换到cv上。存在以下困难： nlp中单词标记是一个基本单元，但是视觉元素在尺度上有很大 ......

Transformer Hierarchical Shifted Windows Vision更新时间 2023-12-17

N维空间长度算角度和长度的计算器下载 N Dims spaces compute angle and length kwlns 2024 download

算N维尺寸的长方体的对角线的长度和角度的软件。 A software that calculates the length and angle of the diagonal of a rectangular prism with N-dimensional dimensions. ......

长度维空间计算器 download 角度更新时间 2023-12-17

RLHF · PbRL | 选择 near on-policy query，加速 policy learning 收敛速度

Query-Policy Misalignment：选择的看似 informative 的 query，实际上可能与 RL agent 的兴趣不一致，因此对 policy learning 几乎没有帮助，最终导致 feedback-efficiency 低下。 ......

policy on-policy learning 速度 query更新时间 2023-12-17

ml.net例子笔记3-Infer.net概率机器学习库

Infer.net Infer.NET is a .NET Foundation project. It's also a part of ML.NET machine learning framework. https://dotnet.github.io/infer/ https://gitee ......

概率 net 例子机器笔记更新时间 2023-12-17

Q-learning与Sarsa算法辨析

这个是Q-learing的一个算法，根据代码，它就是，先设定训练100次，然后，给它一个随机的状态，这里我们假设状态6就是终点，那么走迷宫的时候，如果没走到6，就要一直走下去，，所以里面还要用到一个while循环，然后在每个状态的时候，找一个非负的动作，存储在数组里，（算是合理动作的集合吧），下一个 ......

算法 Q-learning learning Sarsa更新时间 2023-12-16

《MiniGPT-4: Enhancing Vision-language Understanding with Advanced Large Language Models》论文学习

一、ABSTRACT 最新的GPT-4展示了非凡的多模态能力，例如直接从手写文本生成网站和识别图像中的幽默元素。这些特性在以往的视觉-语言模型中很少见。然而，GPT-4背后的技术细节仍然未公开。我们认为，GPT-4增强的多模态生成能力源自于复杂的大型语言模型（LLM）的使用。为了检验这一现象，我们 ......

Vision-language Understanding Enhancing Advanced Language更新时间 2023-12-16

Bioinformatics/ Computational Biology /biostats

Bioinformatics Computational Biology biostats 对于这两个专业，我们可以从应用领域来区分：●Biostatistics生物统计学的研究方向可分为两类：统计遗传学和临床统计学；课程中与生物相关的内容很少，更重视学生的量化能力。●而Bioinformatics ......

Bioinformatics Computational biostats Biology更新时间 2023-12-15

智能计算与图形图像处理Intelligent Computing and Graphics and Image Processing

智能算法 Intelligence Algorithms 图形图像处理 Graphics & Image Processing 机器视觉 machine vision 计算机视觉 computer vision 计算机视觉（computer vision），用计算机来模拟人的视觉机理获取和处理信息的 ......

图形图像 Intelligent Processing Computing and更新时间 2023-12-15

Inferring Developmental Trajectories and Causal Regulations with Single-cell Genomics用单细胞基因组学推断发育轨迹和因果规则

Inferring Developmental Trajectories and Causal Regulations with Single-cell Genomics Development is commonly regarded as a hierarchical branching pro ......

单细胞基因组因果 Developmental Trajectories更新时间 2023-12-15

ICCBDAI2021 第*届计算机大数据与人工智能国际会议 The 2nd International Conference on Computer, Big Data and Artificial Intelligence

第二届计算机大数据与人工智能国际会议（ICCBDAI2021） . 01 会议信息 . 【会议简称】：ICCBDAI2021 【会议全称】： The 2nd International Conference on Computer, Big Data and Artificial Intellige ......

人工智能国际会议 International Intelligence Conference更新时间 2023-12-15

《Progressive Learning of Category-Consistent Multi-Granularity Features for Fine-Grained Visual Classification》阅读笔记

论文标题《Progressive Learning of Category-Consistent Multi-Granularity Features for Fine-Grained Visual Classification》细粒度视觉分类中类别一致多粒度特征的渐进学习作者 Ruoyi D ......

Category-Consistent Multi-Granularity Classification Fine-Grained Progressive更新时间 2023-12-15

【scikit-learn基础】--『预处理』之数据缩放

scikit-learn 基础数据 scikit learn更新时间 2023-12-15

【论文阅读笔记】【多模态-Vision-Language Pretraining】 BLIP: Bootstrapping Language-Image Pre-training for Unified Vision-Language Understanding and Generation

BLIP ICML 2022 (Spotlight) 读论文思考的问题论文试图解决什么问题？写作背景是什么？问题：在视觉-语言预训练（VLP）中，如何更加高效地利用充斥着噪声的海量图文对数据，提升预训练效果？如何设计模型，使得预训练后的模型在理解（understanding-based）任务 ......

Language Vision-Language 模态 Vision Language-Image更新时间 2023-12-14

【scikit-learn基础】--『预处理』之标准化

scikit-learn 基础标准 scikit learn更新时间 2023-12-13

论文精读：STMGCN利用时空多图卷积网络进行移动边缘计算驱动船舶轨迹预测（STMGCN: Mobile Edge Computing-Empowered Vessel Trajectory Prediction Using Spatio-Temporal Multigraph Convolutional Network）

《STMGCN: Mobile Edge Computing-Empowered Vessel Trajectory Prediction Using Spatio-Temporal Multigraph Convolutional Network》论文链接：https://doi.org/10. ......

卷积 STMGCN Computing-Empowered Spatio-Temporal Convolutional更新时间 2023-12-12

[论文阅读] Replacing softmax with ReLU in Vision Transformers

Pre title: Replacing softmax with ReLU in Vision Transformers accepted: Arxiv 2023 paper: https://export.arxiv.org/abs/2309.08586 code: None 关键词：atten ......

Transformers Replacing softmax Vision 论文更新时间 2023-12-12

共689篇 :3/23页 首页上一页123456下一页尾页