|
TuPSBT1 |
Main Hall |
Poster Shotgun (05): PR |
Regular Session |
|
14:00-14:30, Paper TuPSBT1.1 | |
Emotion Recognition Modeling of Sitting Postures by Using Pressure Sensors and Accelerometers |
Shibata, Tatsuya | Tokyo Denki Univ. |
Kijima, Yohei | Tokyo Denki Univ. |
Keywords: Gesture and Behavior Analysis
Abstract: Not only facial expressions but also body gestures and postures play an important role in non-verbal communication. Facial expressions are based on two factors: arousal and pleasant emotions, while it is not clear that body gestures and postures have the same structure as the facial expressions have. We indicate that (1) the sitting postures have the same emotion structure as facial expressions and (2) can be measured by pressure sensors on a chair and accelerometers on the body, which predict the emotion factors. We find the sitting postures have the semantic factors: "arousal", "pleasantness", and "avoidance", so emotion expressions of the sitting postures are similar to those of the facial expressions. Their difference is "avoidance" expressed by not the main body but the body parts such as arms and legs. We conclude that (1) "arousal" and "pleasantness" factors can be measured with the proposed sensors and (2) the body trunk and the body parts: neck, arms, and legs are important.
|
|
14:00-14:30, Paper TuPSBT1.2 | |
A Combined Method for Finding Best Starting Points for Optimisation in Bernoulli Mixture Models |
Frouzesh, Faezeh | Victoria Univ. of Wellington |
Pledger, Shirley | Victoria Univ. of Wellington |
Hirose, Yuichi | Victoria Univ. of Wellington |
Keywords: Statistical, Syntactic and Structural Pattern Recognition, Classification and Clustering
Abstract: Mixture models are frequently used to classify data. They are likelihood based models, and the maximum likelihood estimates of parameters are often obtained using the expectation maximization (EM) algorithm. However, multimodality of the likelihood surface means that poorly chosen starting points for optimisation may lead to only a local maximum, not a global maximum. In this paper, different methods of choosing starting points will be evaluated and compared, mainly via simulated data and then we will introduce a procedure which will make intelligent choices of possible starting points and fast evaluations of their usefulness.
|
|
14:00-14:30, Paper TuPSBT1.3 | |
Road Marking Recognition for Map Generation Using Sparse Tensor Voting |
Ishida, Hiroyuki | TOYOTA Central R&D Lab. Inc. |
Kidono, Kiyosumi | TOYOTA Central R&D Lab. Inc. |
Kojima, Yoshiko | TOYOTA Central R&D Lab. Inc |
Naito, Takashi | TOYOTA Central R&D Lab. Inc. |
Keywords: Classification and Clustering, Camera-Based Document Analysis, Image and Video Understanding
Abstract: A recognition method of road markings for map generation is presented. For accurate position estimation and classification, two voting schemes are proposed and combined. The first is multi-frame sparse tensor voting for geometric feature extraction, and the second is contour localization using the resulting tensor field. Classification is based on the similarity between the aligned contour and the tensor field. The experimental results show that the proposed method outperforms conventional matching-based approaches.
|
|
14:00-14:30, Paper TuPSBT1.4 | |
Efficient Feature Selection for Linear Discriminant Analysis and Its Application to Face Recognition |
Lei, Zhen | Inst. of Automation, Chinese Acad. of Sciences |
Liao, Shengcai | Inst. of Automation, Chinese Acad. of Sciences |
Li, Stan Z. | -CASIA |
Keywords: Biometrics, Feature Reduction and Manifold Learning
Abstract: Feature selection is an important issue in pattern recognition. In face recognition, one of the state-of-the-art methods is that some feature selection methods (e.g., AdaBoost) are first utilized to select the most discriminative features and then the subspace learning methods (e.g., LDA) are further applied to learn the discriminant subspace for classification. However, in these methods, the objective of feature selection and subspace learning is not so consistent and the combination is not the optimal. In this paper, we propose a novel and efficient feature selection method that is designed for linear discriminant analysis (LDA). We use the Fisher criterion to select the most discriminative and appropriate features so that the objectives of feature selection and classifier learning are consistent (both follow the Fisher criterion) and the face recognition performance is expected to be improved. Experiments on FRGC v2.0 face database validate the efficacy of the proposed method.
|
|
14:00-14:30, Paper TuPSBT1.5 | |
PCA Feature Extraction for Change Detection in Multidimensional Unlabelled Streaming Data |
Kuncheva, Ludmila I. | Bangor Univ. |
Faithfull, William J | Bangor Univ. |
Keywords: Machine Learning and Data Mining, Classification and Clustering
Abstract: While there is a lot of research on change detection based on the streaming classification error, finding changes in multidimensional unlabelled streaming data is still a challenge. Here we propose to apply principal component analysis (PCA) to the training data, and mine the stream of selected principal components for change in the distribution. A recently proposed semi-parametric log-likelihood change detector (SPLL) is applied to the raw and the PCA streams in an experiment involving 26 data sets and an artificially induced change. The results show that feature extraction prior to the change detection is beneficial across different data set types, and specifically for data with multiple balanced classes.
|
|
14:00-14:30, Paper TuPSBT1.6 | |
Driving Support Based on Behavior Detection of Vehicle |
Minoura, Kazuma | Nagoya Univ. |
Watanabe, Toyohide | Nagoya Univ. |
Keywords: Gesture and Behavior Analysis, Pattern Recognition for Surveillance and Security
Abstract: Many accidents at intersection are happend due to judgment errors, which are caused from blind spot of drivers. We propose a method for generating warning messages related to the estimated accidents between blind spots of drivers and vehicle behaviors by using surveillance camera and on-board camera frames. In this method, vehicle behaviors of straight, right turn, left turn, change to right lane, and change to left lane are detected by applying hidden markov model (HMM). Direction sequences of behaviors are used for training HMMs which represent each behavior by Baum-Welch algorithm. Behavior information enables the drivers to recieve our intelligent support and allows drivers to recognize behaviors of other vehicles easily. From a result of behavior detection, our system could recognize 80% of behavior correctly.
|
|
14:00-14:30, Paper TuPSBT1.7 | |
Audio-Visual Emotion Recognition with Boosted Coupled HMM |
Lu, Kun | Beijing Inst. of Tech. |
Jia, Yunde | Beijing Inst. of Tech. |
Keywords: Gesture and Behavior Analysis, Classification and Clustering, Human Computer Interaction
Abstract: This paper presents a novel approach for automatic audio-visual emotion recognition. The audio and visual channels provide complementary information for human emotional states recognition, and we utilize coupled HMM (CHMM) as model-level fusion method in our work. To further improve recognition accuracy, we design an AdaBoost-CHMM ensemble classifier which takes CHMM as component classifiers in adaptive boosting procedure. A modified expectation-maximization (EM) algorithm for CHMM learning is proposed to make the learning process focus more on difficult samples. Experiment results on audio-visual emotion data collected in Wizard of Oz scenarios and labeled under two types of emotion category sets demonstrate that our approach is effective and promising.
|
|
14:00-14:30, Paper TuPSBT1.8 | |
Realtime Object Matching with Robust Dominant Orientation Templates |
Chaoqun, Hong | Xiamen Univ. of Tech. |
Jianke, Zhu | Zhejiang Univ. |
Song, Mingli | Zhejiang Univ. |
Wang, Yinting | Zhejiang Univ. |
Keywords: Classification and Clustering, Occlusion and Shadow Detection, Scene Understanding
Abstract: Most of conventional object matching methods are based on comparing the local features, which are too computational demanding to achieve realtime performance on object detection in videos. Recently, Dominant Orientation Templates (DOT) method was proposed to make online feature detection and comparison feasible. However, it still suffers the problem of fragility due to the noise and partial occlusions. To efficiently tackle these problems, we introduce the similarity map to store the matching scores of individual grids in each sliding window, which is used to further denoise and infer the occlusion map. The promising experimental results demonstrate that proposed approach improves the robustness, which outperforms DOT assuredly.
|
|
14:00-14:30, Paper TuPSBT1.9 | |
Does One Rotten Apple Spoil the Whole Barrel? |
Cheplygina, Veronika | Delft Univ. of Tech. |
Tax, David | Delft Univ. of Tech. |
Loog, Marco | Delft Univ. of Tech. / Univ. of Copenhagen |
Keywords: Statistical, Syntactic and Structural Pattern Recognition, Machine Learning and Data Mining, Classification and Clustering
Abstract: Multiple Instance Learning (MIL) is concerned with learning from sets (bags) of objects (instances), where the individual instance labels are ambiguous. In MIL it is often assumed that positive bags contain at least one instance from a so-called concept in instance space. However, there are many MIL problems that do not fit this formulation well, and hence cause traditional MIL algorithms, which focus on the concept, to perform poorly. In this work we show such types of problems and the methods appropriate to deal with either situation. Furthermore, we show that an approach that learns directly from dissimilarities between bags can be adapted to deal with either problem.
|
|
14:00-14:30, Paper TuPSBT1.10 | |
Learning Weighted Features for Human Action Recognition |
Zhou, Wen | Inst. of automation, Chinese Acad. of sciences |
Wang, Chunheng | Inst. of Automation Chinese Acad. of Sciences |
Xiao, Baihua | Inst. of Automation, Chinese Acad. of Sciences |
Zhang, Zhong | Inst. of Automation, Chinese Acad. of Sciences |
Ma, Long | Inst. of Automation, Chinese Acad. of Sciences, China |
Keywords: Gesture and Behavior Analysis, Motion, Tracking and Video Analysis
Abstract: In traditional bag-of-words method, each local feature is treated evenly for representation. One disadvantage of this kind of strategy is that it is not robust to noise, which makes the performance impaired. In this paper, a novel human action recognition approach which learns weights for features is proposed, where each feature is assigned a weight for human action representation. In our proposed strategy, the weights and the discriminative model are learned jointly. There are two advantages of our model. First, small weights are assigned to noise, which can help to reduce the effect of noise in the representation process. Second, discriminative features, which are critical for human action recognition, are assigned large weights. Experimental results demonstrate the advantages of our method.
|
|
14:00-14:30, Paper TuPSBT1.11 | |
Topological Features and Iterative Node Elimination for Speeding up Subgraph Isomorphism Detection |
Dahm, Nicholas | National ICT Australia |
Bunke, Horst | Univ. of Bern |
Caelli, Terry | National ICT Australia |
Gao, Yongsheng | Griffith Univ. |
Keywords: Statistical, Syntactic and Structural Pattern Recognition, Features and Image Descriptors, Pattern Recognition for Search, Retrieval and Visualization
Abstract: In this paper we tackle the problem of subgraph isomorphism detection on large graphs, which may commonly be intractable, even with state of the art algorithms. Rather than competing with other matching algorithms, we define enhancements that can be used by (almost) any subgraph isomorphism algorithm, both current and future. These enhancements consist of a number of topological features to be added to the nodes, and a technique which we term "iterative node elimination". The fusion of these enhancements is shown to be able to reduce subgraph isomorphism matching times by a factor of over 4,500.
|
|
14:00-14:30, Paper TuPSBT1.12 | |
Active Transfer Learning for Multi-View Head-Pose Classification |
Yan, Yan | Univ. of Trento |
Subramanian, Ramanathan | Univ. of Trento |
Lanz, Oswald | Fondazione Bruno Kessler - irst |
Sebe, Nicu | Univ. of Trento |
Keywords: Machine Learning and Data Mining
Abstract: This paper describes an textbf{active transfer learning} technique for multi-view head-pose classification. We combine transfer learning with active learning, where an active learner asks the domain expert to label the few textbf{most informative} target samples for transfer learning. Employing adaptive multiple-kernel learning (AMKL) for head-pose classification from four low-resolution views, we show how active sampling enables more efficient learning with few examples. Experimental results confirm that active transfer learning produces 10% higher pose-classification accuracy over several competing transfer learning approaches.
|
|
14:00-14:30, Paper TuPSBT1.13 | |
Modeling and Identification of Group Motion Via Compound Evaluation of Positional and Directional Cues |
Yucel, Zeynep | ATR |
Miyashita, Takahiro | ATR |
Hagita, Norihiro | ATR |
Keywords: Pattern Recognition for Surveillance and Security, Motion, Tracking and Video Analysis, Scene Understanding
Abstract: This paper addresses the problem of identification of pedestrian groups in crowded environments. To that end, positional and directional relations are modeled accounting for different environmental features and group configurations. Subsequently, a pair of simultaneously observed pedestrians is identified to belong to the same group or not utilizing these models in a parallel manner, which defines a compound hypothesis testing scheme. In case of ambiguities, local and global indicators of group relation are employed in quantifying the reliabilities of the two individual decisions. The contribution of this study lies in the improvement in positional and directional relation models to adjust to different environments and group configurations, description of compound evaluation, as well as resolution of ambiguities proposing uncertainty measures based on local and global indicators of group relation.
|
|
14:00-14:30, Paper TuPSBT1.14 | |
Nonparametric Online Background Generation for Surveillance Video |
Zhang, Rui | Queen's Univ. Canada |
Gong, Weiguo | Chongqing Univ. China |
Yaworski, Andrew | Queen's Univ. |
Greenspan, Michael | Queen's Univ. |
Keywords: Pattern Recognition for Surveillance and Security, Motion, Tracking and Video Analysis, Segmentation, Color and Texture
Abstract: A novel two-stage background generation method is proposed. In the first stage, an intensity-level based statistical approach is employed to identify a variety of background variations. No background training is needed. In the second stage, a background variation based heuristic framework is designed to generate a synchronized background video sequence on-line from the surveillance video. This framework employs both static and dynamic spatial clues in the scene. The quantitative evaluations demonstrate the method can generate more realistic background images than some well-known background modeling techniques.
|
|
14:00-14:30, Paper TuPSBT1.15 | |
Dynamical Ensemble Learning with Model Friendly Classification |
Tu, Wenting | East China Normal Univ. |
Sun, Shiliang | East China Normal Univ. |
Keywords: Machine Learning and Data Mining, Classification and Clustering, Pattern Recognition for Bioinformatics
Abstract: In the domain adaptation research, which recently becomes one of the most important research directions in machine learning, source and target domains are with different underlying distributions. In this paper, we propose an ensemble learning framework for domain adaptation. Owing to the distribution differences between source and target domains, the weights in the final model are sensitive to target examples. As a result, our method aims to dynamically assign weights to different test examples by making use of additional classifiers called model-friendly classifiers. The model-friendly classifiers can judge which base models predict well on a specific test example. Finally, the model can give the most favorable weights to different examples. In the experiments, we firstly testify the need of dynamical weights in the ensemble learning based domain adaptation, then compare our method with other classical methods on real datasets. The experimental results show that our method can learn a final model performing well in the target domain.
|
|
14:00-14:30, Paper TuPSBT1.16 | |
Sliced Inverse Regression with Conditional Entropy Minimization |
Hino, Hideitsu | Waseda Univ. |
Wakayama, Keigo | Waseda Univ. |
Murata, Noboru | Waseda Univ. |
Keywords: Feature Reduction and Manifold Learning, Statistical, Syntactic and Structural Pattern Recognition, Neural Networks
Abstract: An appropriate dimension reduction of raw data helps to reduce computational time and to reveal the intrinsic structure of complex data. In this paper, a dimension reduction method for regression is proposed. The method is based on the well-known sliced inverse regression and conditional entropy minimization. Using entropy as a measure of dispersion of data distribution, dimension reduction subspace is estimated without assuming regression function form nor data distribution, unlike conventional sliced inverse regression. The proposed method is shown to perform well compared to some conventional methods through experiments using both artificial and real-world data sets.
|
|
14:00-14:30, Paper TuPSBT1.17 | |
An Improved Entropy-Based Multiple Kernel Learning |
Hino, Hideitsu | Waseda Univ. |
Ogawa, Tetsuji | Waseda Univ. |
Keywords: Machine Learning and Data Mining, Statistical, Syntactic and Structural Pattern Recognition, Classification and Clustering
Abstract: Kernel methods have been successfully used in many practical machine learning problems. However, the problem of choosing a suitable kernel is left to practitioners. One method to select the optimal kernel is to learn a linear combination of element kernels. A framework of multiple kernel learning based on conditional entropy minimization criterion (MCEM) has been proposed and it has been shown to work well for, e.g., speaker recognition tasks. In this paper, a computationally efficient implementation for MCEM, which utilizes sequential quadratic programming, is formulated. Through a comparative experiment to conventional MCEM algorithm on a speaker verification task, the proposed method is shown to offer comparable verification accuracy with considerable improvement in computational speed.
|
|
14:00-14:30, Paper TuPSBT1.18 | |
Transfer Heterogeneous Unlabeled Data for Unsupervised Clustering |
Kong, Shu | Zhejiang Univ. |
Wang, Donghui | Zhejiang Univ. |
Keywords: Machine Learning and Data Mining, Classification and Clustering, Pattern Recognition for Search, Retrieval and Visualization
Abstract: In this paper, we propose a novel method called textbf{THUNTER} to textbf{t}ransfer the textbf{h}eterogenous textbf{un}labeled data in the source domain to the target domain for clustextbf{ter}ing. Suppose the target data are a set of images, then the so-called heterogeneous unlabeled data can be a large set of text data or acoustic data. Our method aims to address how to transfer these large amount of heterogeneous data to the relatively smaller set of target data for clustering. To the best of our knowledge, it is the first work in the community to transfer the unlabeled data, especially the unlabeled heterogeneous data, for unsupervised clustering. Furthermore, along with our method, a novel textbf{dic}tionary-based data textbf{trans}fer strategy (textbf{DicTrans}) is introduced in this paper, which measures the fidelity of transferring the data in the source domain and automatically decides how many to transfer. Through a series of experiments, the effectiveness of THUNTER and DicTrans are demonstrated with very promising performances.
|
|
14:00-14:30, Paper TuPSBT1.19 | |
Effectively Localize Text in Natural Scene Images |
Liu, Xiaoqian | Graduate Univ. of Chinese Acad. of Sciences |
Lu, Ke | Graduate Univ. of Chinese Acad. of Sciences |
Wang, Weiqiang | Graduate Univ. of Chinese Acad. of Sciences |
Keywords: Pattern Recognition for Search, Retrieval and Visualization, Image and Video Processing, Character and Text Recognition
Abstract: In this paper, we present an effective approach to locate scene text in images based on connected components analysis (CCA). Our approach first utilizes a multi-scale adaptive local thresholding operator to convert an image into two complementary binary images. Then, connected components (CCs) are extracted from both of them, which ensures that bright or dark text in contrast to background can be detected. Further, some rules are designed based on stroke features to verify whether a connected component belongs to characters, and the obtained candidate components are further checked on the word level by using a graph to represent spatial relation of different components. Finally, scene text regions are localized by searching the collinear maximum group over the graph. The comparison experiments of the proposed method with some representative state-of-the-art methods, on the challenging subset of ICDAR 2003, show that the proposed approach is very effective, and it is robust to text of different sizes, fonts,colors, as well as orientation of text lines.
|
|
14:00-14:30, Paper TuPSBT1.20 | |
Wonder Ears: Identification of Identical Twins from Ear Images |
Nejati, Hossein | National Univ. of Singapore, School ofComputing,Department |
Zhang, Li | National Univ. of Singapore |
Sim, Terence | National Univ. of Singapore |
Martinez-Marroquin, Elisa | La Salle School of Engineering � Ramon Llull Univ. |
Dong, Guo | Facebook Inc. |
Keywords: Biometrics, Pattern Recognition for Surveillance and Security, Image and Video Processing
Abstract: While identical twins identification is a well known challenge in face recognition, it seems that no work has explored automatic ear recognition for identical twin identification. Ear image recognition has been studied for years, but Iannarelli (1989) appears to be the only work mentioning the twin identification (performed manually). We here explore the possibility of automatic twin identification from their ear images based on a recently proposed psychological model for face recognition in humans, known as Exception Report Model (ERM). We test our new approach on 39 pairs of identical twins (78 subjects), with several levels of resolution, occlusion and noise, left ear vs. right ear, and feature optimization which verifies the robustness of the introduced features.
|
|
14:00-14:30, Paper TuPSBT1.21 | |
Unsupervised Discriminative Feature Selection in a Kernel Space Via L2, 1-Norm Minimization |
Liu, Yang | Peking Univ. |
Wang, Yizhou | National Engineering Lab. for Video Tech. Key Lab. of Machi |
Keywords: Feature Reduction and Manifold Learning, Classification and Clustering, Machine Learning and Data Mining
Abstract: Traditional nonlinear feature selection methods map the data from an original space into a kernel space to make the data be separated more easily, then move back to the original feature space to select features. However, the performance of clustering or classification is better in the kernel space, so we are able to select the features directly in the kernel space and get the direct importance of each feature. Motivated by this idea, we propose a novel method for unsupervised feature selection directly in the kernel space. To do this, we utilize local discriminative information to find the best label for each instance with L2,1-norm minimization, then select the most important features in the kernel space using the labels predicted. Extensive experiments demonstrate the effectiveness of our method.
|
|
14:00-14:30, Paper TuPSBT1.22 | |
Model-Based Feature Refinement by Ellipsoidal Face Tracking |
Jung, Sung-Uk | ETRI |
Nixon, Mark | Univ. of Southampton |
Keywords: Biometrics, Motion, Tracking and Video Analysis, Gesture and Behavior Analysis
Abstract: We describe a new method to relieve common assumptions/ restrictions in head tracking by using a model-based approach. This improves local feature matching which only considers the pattern around the extracted feature excluding the object shape, so that misalignment can occur. In this paper, to overcome constraints on motion we consider region- and distance-based feature refinement methods to validate the local features used when tracking the ellipsoidal object. We also present a direct mapping method to reconstruct 3D feature positions for tracking. The utility of the new method has been demonstrated for face pose estimation using the Boston face database.
|
|
14:00-14:30, Paper TuPSBT1.23 | |
Discriminative Metric: Schatten Norm vs. Vector Norm |
Gu, Zhenghong | StateUniversity of New York (SUNY) at Buffalo |
Shao, Ming | Univ. at Buffalo |
Li, Liangyue | Univ. at Buffalo |
Fu, Yun | SUNY at Buffalo |
Keywords: Feature Reduction and Manifold Learning, Biometrics, Machine Learning and Data Mining
Abstract: The notion of metric is fundamental for the study of pattern recognition and vector 2-norm |cdot|_{2} is one of the most widely used metric, i.e., Euclidean distance. However, there is often the case that the inputs are matrices, e.g., 2D images in face recognition. Since a matrix can take more structure information than its vectorization, it is highly preferable to adopt the matrix representation of the original image rather than a simple vector. In this paper, we first propose a class of discriminative metrics for matrices, i.e., Schatten p-norm, by which we can better explain that with Euclidean metric, why the differences among facial images due to impact factors, e.g., illuminations, are more significant than differences due to identity variations. Second, we propose a novel Principal Component Analysis method based on Schatten 1-norm which can be easily extended to other subspace learning methods. Extensive experiments on Yale B, CMU PIE, ORL and AR databases prove the effectiveness of our method.
|
|
14:00-14:30, Paper TuPSBT1.24 | |
Localization and Extraction of the Four Clock-Digits Using the Knowledge of the Digital Video Clock |
Yu, Xinguo | The Central China Normal Univ. |
Keywords: Statistical, Syntactic and Structural Pattern Recognition, Pattern Recognition for Search, Retrieval and Visualization
Abstract: This paper presents a novel algorithm for localizing and extracting the four clock-digits by using the knowledge of the digital video clock. This algorithm has triple merits compared with the existing ones. First, it is novel and easy implementation. It uses the pixel second-periodicity method. By using this method the algorithm omits the several tedious image processing steps. Second, it is faster than the existing algorithms. Third, it has good performance in both localization and extraction. The good performance is due to that it uses the domain knowledge in acquiring the digits color and in modeling the clock-digits layout. Experimental results show that our algorithm achieves an accuracy of 100% in localizing the four clock-digits at a very low computing cost and extracts the four clock-digits in a high accuracy.
|
|
14:00-14:30, Paper TuPSBT1.25 | |
Insect Species Recognition Using Discriminative Local Soft Coding |
Lu, An | Inst. of Automation, Chinese Acad. of Sciences |
Liu, Cheng-Lin | Inst. of Automation, Chinese Acad. of Sciences |
Hou, Xinwen | Inst. of Automation, Chinese Acad. of Sciences |
Chen, XiaoLin | Inst. of Zoology, Chinese Acad. of Sciences |
Keywords: Classification and Clustering, Image and Video Understanding
Abstract: Insect species recognition is more difficult than generic object recognition because of the similarity between different species. In this paper, we propose a hybrid approach called discriminative local soft coding (DLSoft) which combines local and discriminative coding strategies together. Our method takes use of neighbor codewords to get a local soft coding and class specific codebooks (sets of codewords) for a discriminative representation. On obtaining the vector representation of image via spatial pyramid pooling of patches, a linear SVM classifier is used to classify images into species. Experimental results show that the proposed method performs well on insect species recognition and outperforms the state-of-the-art methods on generic object categorization.
|
|
14:00-14:30, Paper TuPSBT1.26 | |
Advanced Ridge Flux Analysis for Fingerprint Minutiae Detection |
Ohtsuka, Tomohiko | Tokyo National Coll. of Tech. |
Keywords: Biometrics, Statistical, Syntactic and Structural Pattern Recognition, Detection, Separation and Segmentation
Abstract: This paper presents new fingerprint minutiae detection by the advanced ridge flux analysis. The considerable processing time taken by the conventional approaches, most of which use the ridge thinning process with a rather large calculation time, is a problem that has recently attracted increased attention. Though Ridge flux analysis method without using thinning process is proposed in order to reduce the computational time, there still remains a problem with low detection accuracy. The proposed approach is applied to detect minutiae by analyzing the curvature of ridge contours to achieve both of the computation time reduction and higher detection accuracy. The experimental results show that the proposed approach can achieve a reduction in calculation time, while achieving the same success detection rate as that of the conventional approaches.
|
|
14:00-14:30, Paper TuPSBT1.27 | |
Fast Approximated Relational and Kernel Clustering |
Schleif, Frank-Michael | Univ. of Bielefeld |
Zhu, Xibin | Univ. of Bielefeld |
Gisbrecht, Andrej | Univ. of Bielefeld |
Hammer, Barbara | Univ. of Bielefeld |
Keywords: Machine Learning and Data Mining, Classification and Clustering
Abstract: The large amount of digital data requests for scalable tools like efficient clustering algorithms. Many algorithms for large data sets request linear separability in an Euclidean space. Kernel approaches can capture the non-linear structure but do not scale well for large data sets. Alternatively, data are often represented implicitly by dissimilarities like for protein sequences, whose methods also often do not scale to large problems. We propose a single algorithm for both type of data, based on a batch approximation of relational soft competitive learning, termed fast generic soft-competitive learning. The algorithm has linear computational and memory requirements and performs favorable to traditional techniques
|
|
14:00-14:30, Paper TuPSBT1.28 | |
Speeding up Optimum-Path Forest Training by Path-Cost Propagation |
Iwashita, Adriana | Sao Paulo State Univ. |
Papa, Joao Paulo | Sao Paulo State Univ. - UNESP |
Falcao, Alexandre Xavier | State Univ. of Campinas |
Lotufo, Roberto | Univ. of Campinas |
Oliveira, Victor Matheus | Univ. of Campinas |
Albuquerque, Victor Hugo Costa | Univ. of Fortaleza |
Tavares, Joao Manuel R. S. | Faculdade de Engenharia da Univ. do Porto |
Keywords: Classification and Clustering, Machine Learning and Data Mining
Abstract: In this paper we present an optimization of the Optimum-Path Forest classifier training procedure, which is based on a theoretical relationship between minimum spanning forest and optimum-path forest for a specific path-cost function. Experiments on public datasets have shown that the proposed approach can obtain similar accuracies to the traditional one, but faster for data training.
|
|
14:00-14:30, Paper TuPSBT1.29 | |
Incoherent Dictionary Learning for Sparse Representation |
Lin, Tong | Peking Univ. |
Liu, Shi | Peking Univ. |
Zha, Hongbin | Peking Univ. |
Keywords: Classification and Clustering, Machine Learning and Data Mining, Feature Reduction and Manifold Learning
Abstract: Recent years have witnessed a growing interest in the sparse representation problem. Prior work demonstrated that adaptive dictionary learning techniques can greatly improve the performance of sparse representation approaches. Existing techniques mainly focus on the reconstructive accuracies and the discriminative power of the learned dictionary, whereas the mutual incoherence between any two basis atoms has been rarely studied yet. This paper proposes a novel method by explicitly incorporating a correlation penalty into the dictionary learning model. Experiments show that the proposed method can remarkably reduce the correlation measure of the learned dictionaries, and at the same time achieve higher classification accuracies than state-of-the-art algorithms.
|
|
14:00-14:30, Paper TuPSBT1.30 | |
Similarity Weighted Sparse Representation for Classification |
Guo, Song | Inst. of Information Science, BeijingJiaotongUniversity |
Ruan, Qiuqi | Beijing Jiaotong Univ. |
Miao, Zhenjiang | Inst. of Information Science, Beijing Jiaotong Univ. |
Keywords: Classification and Clustering, Pattern Recognition for Bioinformatics
Abstract: In this paper, we propose a novel sparse representation method for classification called similarity weighted sparse representation (SWSR). The similarity weighted ℓ1-norm minimization, where the weighted matrix is constructed by incorporating the similarity information between the test sample and the entire training samples, is presented as an alternative to ℓ0-norm minimization to seek the optimal sparse representation for the test sample in SWSR. The sparse solution of SWSR encodes more discriminative information than other competing alternatives to ℓ0- norm minimization, so it is more suitable for classification. The experimental results on publicly available face databases demonstrate the efficacy of the proposed method.
|
|
14:00-14:30, Paper TuPSBT1.31 | |
Occluded Human Action Analysis Using Dynamic Manifold Model |
Chen, Li-Chih | Yuan-Ze Univ. |
Hsieh, Jun-Wei | -National Taiwan Ocean Univ. |
Chuang, Chi-Hung | Fo Guang Univ. |
Chen, Duan-Yu | Yuan-Ze Univ. |
Keywords: Pattern Recognition for Surveillance and Security, Motion, Tracking and Video Analysis, Classification and Clustering
Abstract: This paper proposes a novel nonlinear manifold learning method for addressing the ill-posed problem of occluded human action analysis. As we know, a person can perform a broad variety of movements. To capture the multiplicity of a human action, this paper creates a low-dimensional manifold for capturing the intra-path and inter-path contexts of an event. Then, an action path matching scheme can be applied for seeking the best event path for linking the missed information between occluded persons. After that, a recovering scheme is proposed for repairing an occluded object to a complete one. Then, each action can be converted to a series of action primitives through posture analysis. Since occluded objects are handled, there will be many posture-symbol- converting errors. Instead of using a specific symbol, we code a posture using not only its best matched key posture but also its similarities among other key postures. Then, recognition of an action taken from occlude objects can be modeled as a matrix matching problem. With the matrix representation, different actions can be more robustly and effectively matched by comparing their Kullback�Leibler(KL) distances.
|
|
14:00-14:30, Paper TuPSBT1.32 | |
Face Verification Using Temporal Affective Cues |
Ng, Ee Sin | Inst. for Infocomm Res. |
Chia, Yong Sang Alex | Inst. for Infocomm Res. |
Keywords: Pattern Recognition for Bioinformatics, Classification and Clustering, Biometrics
Abstract: It is widely accepted that biometric authentication systems based on human faces are vulnerable to spoofing attacks, in which attackers exploit recaptured photos of legitimate users to gain unauthorized access. To address this vulnerability, numerous approaches which employ complex physics-based models to verify the liveness of presented images have been proposed. While capable of distinguishing between real and recaptured images, a key weakness of these methods is their reliance on an appropriate choice of the model used. Here, we adopt a fundamentally different approach to detect recaptured images. Our method uses randomized temporal affective cues in the form of facial expressions to verify the liveness of users. A quantitative evaluation user study involving 6 users demonstrated the feasibility of our method, and we have also shown that the system is capable of achieving an average classification accuracy of 95.85% for different facial expressions.
|
|
14:00-14:30, Paper TuPSBT1.33 | |
Graph-Based Dimensionality Reduction for KNN-Based Image Annotation |
Liu, Xi | Fujitsu Res. & Development Center Co., LTD |
Liu, Rujie | Fujitsu Res. & Development Center |
Li, Fei | Fujitsu Res. & Development Center Co., Ltd. |
Cao, Qiong | Fujitsu Res. & Development Center Co., Ltd. |
Keywords: Feature Reduction and Manifold Learning, Multimedia Analysis, Indexing and Retrieval
Abstract: KNN-based image annotation method is proved to be very successful. However, it suffers from two issues: (1) high computational cost; (2) the difficulty of finding semantically similar images. In this paper, we propose a graph-based dimensionality reduction method to solve the two problems by adapting the locality sensitive discriminant analysis method [1] to multi-label setting. We first determine relevant and irrelevant images based on label information and construct relevant and irrelevant graphs by focusing on the visually similar relevant and irrelevant images. A linear feature transformation matrix is derived by considering the two graphs. The transformation can map the images to a low-dimensional subspace in which neighborhood relevant images are pulled closer while irrelevant images are pushed away. Thus the new feature after dimensionality reduction is quite fit for KNN-based image annotation. Experiments on the Corel dataset also demonstrate the effectiveness of our dimensionality reduction method for KNN-based image annotation.
|
|
14:00-14:30, Paper TuPSBT1.34 | |
Age Classification in Unconstrained Conditions Using LBP Variants |
Ylioinas, Juha Sakari | Univ. of Oulu, Center for Machine Vision Res. |
Hadid, Abdenour | Univ. of OULU |
Pietik�inen, Matti | Univ. of Oulu |
Keywords: Biometrics, Features and Image Descriptors, Classification and Clustering
Abstract: Automatic age classification from human faces is a challenging task which has recently attained an increasing attention. Most of the proposed approaches have however been mainly concerning controlled settings. In this paper, we propose a novel method for age classification in unconstrained conditions and provide extensive performance evaluation on benchmark datasets with standard protocols, thus allowing a fair comparison and an easy reproduction of the results. Our proposed method is based on a combination of local binary pattern (LBP) variants encoding the structure of elongated facial micro-patterns and their strength. The experimental analysis points out the complexity of the age classification problem under uncontrolled settings. The proposed method provides state-of-the-art performance that can be used as a reference for future investigations.
|
|
14:00-14:30, Paper TuPSBT1.35 | |
Classification Using Graph Partitioning |
Valev, Ventzeslav | Inst. of Mathematics and Informatics, Bulgarian Acad. Sc |
Yanev, Nicola | Univ. of Sofia and Inst. of Mathematics and Informatics |
Keywords: Machine Learning and Data Mining, Statistical, Syntactic and Structural Pattern Recognition, Classification and Clustering
Abstract: This paper explores the classification problem based on parallel feature partitioning. This formulation leads to a new problem in computational geometry. While this new problem appears to be NP-complete, it is shown that the proposed graph theoretical platform makes it semi-tractable, allowing the use of conventional tools for its solution. Here, by conventional, we mean any exact or heuristic algorithm for partitioning a graph into a minimal number of cliques or for finding the clique of maximum cardinality while seeking an efficient heuristic algorithm. An important advantage of this approach is the decomposition of a problem involving l classes into l optimization problems involving a single class. The computational complexity of the method, computational procedures, and classification rules are discussed. A geometrical interpretation of the solution is also given. Using the proposed approach, the geometrical structure of the training set is utilized in the best possible way.
|
|
14:00-14:30, Paper TuPSBT1.36 | |
Metric Learning by Directly Minimizing the K-NN Training Error |
Chernoff, Konstantin | Univ. of Copenhagen |
Loog, Marco | Delft Univ. of Tech. / Univ. of Copenhagen |
Nielsen, Mads | Univ. of Copenhagen |
Keywords: Feature Reduction and Manifold Learning, Classification and Clustering, Machine Learning and Data Mining
Abstract: This paper presents an approach for computing global distance metrics that minimize the k-NN leave-one-out (LOO) error. The approach optimizes an energy function that corresponds to a smoothened version of the k-NN LOO error. The generalization of the proposed approach is further improved by controlling the k parameter through a heuristic. Evaluation of the proposed approach on several public datasets showed that it was able to compete with an established state-of-the art approach.
|
|
14:00-14:30, Paper TuPSBT1.37 | |
Fast-Accurate 3D Face Model Generation Using a Single Video Camera |
Hara, Tomoya | Waseda Univ. |
Kubo, Hiroyuki | Waseda Univ. |
Maejima, Akinobu | Waseda Univ. |
Morishima, Shigeo | Waseda Univ. |
Keywords: Pattern Recognition for Bioinformatics, Pattern Recognition for Surveillance and Security, Motion, Tracking and Video Analysis
Abstract: In this paper, we present a new method to generate a 3D face model, based on both Data-Driven and Structure-from-Motion approach. Considering both 2D frontal face image constraint, 3D geometric constraint, and likelihood constraint, we are able to reconstruct subject�s face model accurately, robustly, and automatically. Using our method, it is possible to create a 3D face model in 5.8 [sec] by only shaking own head freely in front of a single video camera.
|
|
14:00-14:30, Paper TuPSBT1.38 | |
Robust Car License Plate Recognition System Verified with 163, 574 Images Captured in Fields |
Taniyama, Kazuhiko | Mitsubishi Precision Co.,Ltd. |
Hayashi, Kentaro | Mitsubishi Precision Co.,Ltd. |
Keywords: Pattern Recognition for Search, Retrieval and Visualization, Detection, Separation and Segmentation, Image and Video Processing
Abstract: We have been manufacturing our own License Plate Recognition (LPR) systems since 1999 for parking lots in Japan. We have implemented a number of enhancements for our latest LPR system and one of the most important enhancements is for ranges of acceptable license-plate attitude angles. We collected 163,574 actual images so far and we selected 6,648 images randomly from this image database to verify our latest design. The histograms of plate attitude angles are estimated for these 2,836 images by measuring the license-plate shapes in the images and analyzed statistically. The analysis results for these actual 2,836 images prove that our latest LPR system accepts the attitude angles within its product specifications with enough margins.
|
|
14:00-14:30, Paper TuPSBT1.39 | |
Searching for the Optimal Ordering of Classes in Rule Induction |
Ata, Sezin | Işık Univ. |
Yildiz, Olcay Taner | Isik Univ. |
Keywords: Machine Learning and Data Mining, Classification and Clustering, Statistical, Syntactic and Structural Pattern Recognition
Abstract: Rule induction algorithms such as Ripper, solve a K > 2 class problem by converting it into a sequence of K-1 two-class problems. As a usual heuristic, the classes are fed into the algorithm in the order of increasing prior probabilities. In this paper, we propose two algorithms to improve this heuristic. The first algorithm starts with the ordering the heuristic provides and searches for better orderings by swapping consecutive classes. The second algorithm transforms the ordering search problem into an optimization problem and uses the solution of the optimization problem to extract the optimal ordering. We compared our algorithms with the original Ripper on 8 datasets from UCI repository [2]. Simulation results show that our algorithms produce rulesets that are significantly better than those produced by Ripper proper.
|
|
14:00-14:30, Paper TuPSBT1.40 | |
Inverse Biometrics: A Case Study in Hand Geometry Authentication |
Gomez-Barrero, Marta | Univ. Autonoma de Madrid |
Galbally, Javier | Univ. Autonoma de Madrid |
Morales, Aythami | Univ. de las Palmas de Gran Canaria |
Ferrer, M.A. | Univ. Las Palmas de Gran Canaria |
Fierrez, Julian | Univ. Autonoma de Madrid |
Ortega-Garcia, Javier | Univ. Autonoma de Madrid |
Keywords: Biometrics, Pattern Recognition for Surveillance and Security, Image and Video Processing
Abstract: Recently, a considerable amount of research has been focused on inverse biometrics, that is, regenerat- ing the original biometric sample from its template. In this work, the first reconstruction approach to recover hand geometry samples from their feature vectors is proposed. Experiments are carried out on the publicly available GPDS Hand DB, where the method has shown a remarkable performance, after reconstructing a very high percentage of the hands included in the dataset. Furthermore, the proposed technique is general, being able to successfully reproduce the original hand shape sample regardless of the information and format of the template used.
|
|
14:00-14:30, Paper TuPSBT1.41 | |
An Alternative to IDF: Effective Scoring for Accurate Image Retrieval with Non-Parametric Density Ratio Estimation |
Uchida, Yusuke | KDDI R&D Lab. Inc. |
Takagi, Koichi | KDDI R&D Lab. Inc. |
Sakazawa, Shigeyuki | KDDI R&D Lab. Inc. |
Keywords: Pattern Recognition for Search, Retrieval and Visualization, 2D/3D Object Detection and Recognition
Abstract: In this paper, we propose a new scoring method for local feature-based image retrieval. The proposed score is based on the ratio of the probability density function of an object model to that of background model, which is efficiently calculated via nearest neighbor density estimation. The proposed method has the following desirable properties: (1) a sound theoretical basis, (2) effectiveness than IDF scoring, (3) applicability not only to quantized descriptors but also to raw descriptors, and (4) ease and efficiency of calculation and updating. We show the effectiveness of the proposed method empirically by applying it to a bag-of-visual words-based framework and a k-NN voting framework.
|
|
14:00-14:30, Paper TuPSBT1.42 | |
Compound Color Recognition Via Image Analysis on High-Throughput Compound Libraries |
von Korff, Modest | Actelion Pharmaceuticals Ltd. |
Freyss, Joel | Actelion Pharmaceuticals Ltd. |
Klenk, Axel | Actelion Pharmaceuticals Ltd. |
Silva, Joao | Actelion Pharmaceuticals Ltd. |
Bourquin, Geoffroy | Actelion Pharmaceuticals Ltd. |
Peter, Oliver | Actelion Pharmaceuticals Ltd. |
Sander, Thomas | Actelion Pharmaceuticals Ltd. |
Keywords: Pattern Recognition for Bioinformatics, Segmentation, Color and Texture
Abstract: A new machine vision method is presented to detect micro tubes with colored solutions or undissolved compounds in solvents on screening plates used in drug discovery. The method presented herein takes an image of a 96 tubes micro tube rack as input. After applying edge detection on the input image, the circles that characterize the borders of the micro tubes are detected via shape matching. A gradient search is used to extract the colors from the micro tubes. Automatic calibration is applied to adapt to changing light conditions. Experiments with more than two thousand micro tube racks (approx. 170,000 compounds) compiled over three years showed that the method is fast and robust.
|
|
14:00-14:30, Paper TuPSBT1.43 | |
Face Recognition Using Semi-Supervised Spectral Feature Selection |
Zhang, Zhihong | Univ. of York |
Hancock, Edwin | Univ. of York |
Keywords: Machine Learning and Data Mining, Feature Reduction and Manifold Learning, Classification and Clustering
Abstract: Semi-supervised learning is important when labeled data are scarce. In this paper, we develop a novel semi-supervised spectral feature selection technique using label regression and by using ell_{1}-norm regularized models for subset selection. Specifically, we propose a new two-step spectral regression technique for semi-supervised feature selection. In the first step, we use label propagation and label regression to transform the data into a lower-dimensional space so as to improve class separation. Second, we use ell_{1}-norm regularization to select the features that best align with the lower-dimensional data. Using ell_{1}-norm regularization, we cast feature discriminant analysis into a regression framework which accommodates the correlations among features. As a result, we can evaluate joint feature combinations, rather than being confined to consider them individually. Experimental results demonstrate the effectiveness of our feature selection method on standard face data-sets.
|
|
14:00-14:30, Paper TuPSBT1.44 | |
Interactive Graph Matching by Means of Imposing the Pairwise Costs |
Serratosa, Francesc | Univ. Rovira i Virgili |
Cort�s, Xavier | Univ. Rovira i Virgili |
Sol�-Ribalta, Albert | Univ. Rovira i Virgili |
Keywords: Statistical, Syntactic and Structural Pattern Recognition, Classification and Clustering, Machine Learning and Data Mining
Abstract: We present a method to perform graph matching in which the human can interact and impose part of the graph labelling. Humans are very good at finding the correspondences between parts of two images but finding these correspondences is one of the most difficult tasks in pattern recognition. Through simple actions such as impose a node labelling or consider a node labelling is not correct; the user helps the automatic graph-matching algorithm to decide a labelling that is closer to the one that the human desires. This interaction is done through the modification of the initial pairwise costs. The method is independent on the graph-matching algorithm. Practical evaluation, in which the Graduated Assignment has been used, shows that with few interactions, the algorithm arrives at the ideal labelling.
|
|
14:00-14:30, Paper TuPSBT1.45 | |
A New Feature and Associated Optimal Spatial Filter for EEG Signal Classification: Waveform Length |
Lotte, Fabien | Inria Bordeaux Sud-Ouest |
Keywords: Pattern Recognition for Bioinformatics, Enhancement, Restoration and Filtering, Classification and Clustering
Abstract: In this paper, we introduce Waveform Length (WL), a new feature for ElectroEncephaloGraphy (EEG) signal classification which measures the signal complexity. We also propose the Waveformlength Optimal Spatial Filter (WOSF), an optimal spatial filter to classify EEG signals based on WL features. Evaluations on 15 subjects suggested that WOSF with WL features provide performances that are competitive with that of Common Spatial Patterns (CSP) with Band Power (BP) features, CSP being the optimal spatial filter for BP features. More interestingly, our results suggested that combining WOSF with CSP features leads to classification performances that are significantly better than that of CSP alone (80% versus 77% average accuracy respectively).
|
|
14:00-14:30, Paper TuPSBT1.46 | |
Unsupervised Online Learning Trajectory Analysis Based on Weighted Directed Graph |
Shen, Yuan | Beijing Jiaotong Univ. |
Miao, Zhenjiang | Inst. of Information Science, Beijing Jiaotong Univ. |
Zhang, Jian | Univ. of Tech. Sydney |
Keywords: Pattern Recognition for Surveillance and Security, Gesture and Behavior Analysis, Classification and Clustering
Abstract: In this paper, we propose a novel unsupervised online learning trajectory analysis method based on weighted directed graph. Each trajectory can be represented as a sequence of key points. In the training stage, unsupervised expectation-maximization algorithm (EM) is applied for training data to cluster key points. Each class is a Gaussian distribution. It is considered as a node of the graph. According to the classification of key points, we can build a weighted directed graph to represent the trajectory network in the scene. Each path is a category of trajectories. In the test stage, we adopt online EM algorithm to classify trajectories and update the graph. In the experiments, we test our approach and obtain a good performance compared with state-of-the-art approaches.
|
|
14:00-14:30, Paper TuPSBT1.47 | |
A One-Per-Class Reconstruction Rule for Class Imbalance Learning |
D'Ambrosio, Roberto | Univ. Campus Bio-Medico of Rome |
Iannello, Giulio | Univ. Campus Bio-Medico di Roma |
Soda, Paolo | Univ. Campus Bio-Medico di Roma |
Keywords: Machine Learning and Data Mining, Classification and Clustering
Abstract: Class imbalance limits the performance of most learning algorithms since they cannot cope with large differences between the number of samples in each class, resulting in a low predictive accuracy over the minority ones. Several algorithms achieving more balanced performance in case of binary learning have been proposed, while few researches exists in case of multiclass learning. This paper proposes a new reconstruction rule for the One-per-Class (OpC) decomposition method that, distinguishing between safe and dangerous classifications using sample classification reliability, compensates class imbalance in multiclass recognition problems and reduces effects due to the skewness between classes. The approach has been successfully tested on five datasets using three different classification architectures, and it favourably compares with results provided both by a multiclass classifier and by a popular OpC reconstruction rule.
|