首页 > 其他分享 >【Notes】Scene Recognition

【Notes】Scene Recognition

时间:2023-03-20 12:34:17浏览次数:34  
标签:features Notes Scene SIFT they local image Recognition

Paper 1: Beyond  Bags of Features: Spatial Pyramid Matching for Recognizing Natural Scene Categories

文献信息:

概念界定:SPM Spatial Pyramid Matching. The resulting "spatial pyramid" is a simple and conputationally efficient extension of an orderless bag-of-features image representation.

research gap: bag-of-features methods, which represent an image as an orderless collection of local features, have recently demonstrated impressive levels of performance. However, because these methods disregard all information about the spatial layout of the features, they have severely limited descriptive ability. In particular, they are incapable of capturing shape or of segmenting an object from its background.

目标问题:两个集合的比较问题,这两个集合所含的特征数不同,而且特征之间无序。

创新点:This paper presents a method for recognizing scene categories based on approximate global geometric correspondence.

研究方法:This trchnique works by partitioning the image into increasingly fine sub-regions and computing histograms of local features found inside each subregion.

研究目的:

作者观点:

 

 

引用例证:

 

研究意义:

Github:

其他:

特征提取:通常使用dense-SIFT,就是在图片上撒一个网格,每个网格中心点都计算一个SIFT(128维)的表达,将图片表示成一堆SIFT的集合。

字典训练:K-Means

.

 

标签:features,Notes,Scene,SIFT,they,local,image,Recognition
From: https://www.cnblogs.com/zhaoke271828/p/17235815.html

相关文章

  • 论文阅读—第二篇《Deep Residual Learning for Image Recognition》
    DeepResidualLearningforImageRecognition论文链接1.简介《DeepResidualLearningforImageRecognition》是2015年由何凯明等人提出的一篇论文,该论文提出了一......
  • Dynamics CRM ,file field and notes total size check
    letfile_is_uploaded;letfileMaxSize=20971520;//20Mvartimer=0;functionmyfun(executionContext){varformContext=executionContext.getFormContext();......
  • MySQL-notes
    一、SQLsql分类分类全称说明DDLDataDefinitionLanguage数据定义语言,用来定义数据库对象(数据库,表,字段)DMLDataManipulationLanguage数据操作语言,......
  • Linux-notes
    一、Linux基础命令一、Linux的目录结构1、目录结构Linux的目录结构是一个树型结构Windows系统可以拥有多个顶级目录,即各个盘符,如C盘、D盘、E盘Linux没有盘符这个概念......
  • 【notes】Multimedia
     数据表示感知与获取:例如声波通过振膜和电磁铁的磁极,得到模拟信号,模拟信号经过采样和量化得到数字信号。连续的灰度值经过量化得到0-255的整数值。听觉媒体:Definition......
  • 车牌识别(vehicle license plate recognition)
    车牌识别(vehiclelicenseplaterecognition)项目地址:https://github.com/jiangnanboy/vehicle_license_plate_recognition车牌识别系统:1.利用yolo5进行车牌检测2.利......
  • Use CMake notes
    WhayneedtheCMake?如果只是构建一个只有一个main.cpp的小型项目,那么确实不需要CMake,直接GCC、G++编译,或者写个build.sh脚本即可,不需要把简单的问题搞复杂化。$g++......
  • 《VERY DEEP CONVOLUTIONAL NETWORKS FOR LARGE-SCALE IMAGE RECOGNITION (VGG) 2014
    abstract在这项工作中,作者研究了卷积网络深度对其在大规模图像识别设置中的准确性的影响。其主要贡献是:使用非常小(3×3)卷积滤波器的架构,对更深的网络进行了全面评估,通......
  • JavaFX Scene Builder 下载安装
    (JavaFXSceneBuilder下载安装)SceneBuilder是创建漂亮的用户界面,并将您的设计转化为交互式原型。SceneBuilder通过创建可以直接在JavaFX应用程序中使用的用户界面,缩......
  • PANNs: Large-Scale Pretrianed Audio Neural Networks for Audio Pattern Recognitio
    AudioPatternRecognitionincludes:audiotaggingacousticsceneclassificationmusicclassificationspeechemotionclassificationsoundeventdetection.........