|
ThPSAT1 |
Main Hall |
Poster Shotgun (11): PR |
Regular Session |
|
08:30-09:00, Paper ThPSAT1.1 | |
Object Clique Representation for Scenes Classification |
Chen, Jingjing | Tianjin Univ. |
Cao, Xiaochun | Tianjin Univ. |
Zhang, Bao | Tianjin Univ. |
Keywords: Classification and Clustering, Scene Understanding
Abstract: High-level visual recognition such as scene classification is a challenging task in computer vision. In this paper, we propose an image descriptor based on semantic cliques obtained by high-order pure dependence, and the image is represented by a vector whose element denotes the probability of containing each object cliques. Compared with using single objects as attributes, such representation carries corresponding semantic information, making it more suitable for high-level visual recognition tasks. The experiments show that our object cliques as attributes for scene representation improves the accuracy of image classification.
|
|
08:30-09:00, Paper ThPSAT1.2 | |
Towards Breast Ultrasound Image Segmentation Using Multi-Resolution Pixel Descriptors |
Rodrigues, Rafael | U.B.I. - Univ. da Beira Interior |
Pinheiro, Antonio | U.B.I. - Univ. da Beira Interior |
Braz, Rui | U.B.I. - Univ. da Beira Interior |
Pereira, Manuela | U.B.I. - Univ. da Beira Interior |
Moutinho, Jose | U.B.I. - Univ. da Beira Interior |
Keywords: Pattern Recognition for Bioinformatics, Segmentation, Color and Texture, Medical Image Analysis and Registration
Abstract: Breast ultrasound images are an important diagnostic factor for breast cancer detection. However, ultrasound imaging is intrinsically degraded by noise, resulting in a difficult detection of masses or nodules, and, most importantly, the evaluation of their size and shape. Computer-aided diagnosis figures as a major help factor, when it comes to analyzing this type of medical imaging. A fully automated and computationally efficient method for breast ultrasound segmentation is proposed. The algorithm classifies the images, with Support Vector Machines and Discriminant Analysis classifiers, based on a pixel descriptor formed with the information from anisotropic diffusion, band-pass filtering and scale-space curvature. The final segmentation results after the application of a set of heuristic rules for the selection of the classifiers' result, based on the ultrasound image characteristics. The final segmentation results yielded good overall accuracy, precision and also recall rates.
|
|
08:30-09:00, Paper ThPSAT1.3 | |
Hand-Dorsa Vein Recognition Based on Multi-Level Keypoint Detection and Local Feature Matching |
Tang, Yinhang | Beihang Univ. |
Huang, Di | Beihang Univ. |
Wang, Yunhong | Beihang Univ. |
Keywords: Biometrics
Abstract: As a new biometric for person authentication, hand-dorsa vein has attracted increasing attention in recent years. This paper proposes a novel approach for hand-dorsa vein recognition, which makes use of multi-level keypoint detection and SIFT feature based local matching. In order to overcome the difficulty in finding local features on NIR images of hand dorsa, a multi-level keypoint detection approach, composed by Harris-Laplace and Hessian-Laplace detectors, is designed to localize enough keypoints so that more discriminative information can be highlighted. Then SIFT based local matching efficiently associates these keypoints between hand dorsa of the same individual. The experimental results achieved on the NCUT database clearly indicate the effectiveness of the proposed method for hand-dorsa vein recognition.
|
|
08:30-09:00, Paper ThPSAT1.4 | |
Jensen Divergence Based SPD Matrix Means and Applications |
Nielsen, Frank | Sony Computer Science Lab. Inc |
Liu, Meizhu | Siemens Corp. Res. |
Ye, Xiaojing | Georgia Tech. |
Vemuri, Baba | Univ. of Florida |
Keywords: Classification and Clustering, Image and Video Processing
Abstract: Finding mean of matrices becomes increasingly important in modern signal processing problems that involve matrix-valued images. In this paper, we define the mean for a set of symmetric positive definite (SPD) matrices based on information-theoretic divergences as the unique minimizer of the averaged divergences, and compare it with the means computed using the Riemannian and Log-Euclidean metrics. For the class of divergences induced by the convexity gap of a matrix functional, we present a fast iterative concave-convex optimization scheme with guaranteed convergence to efficiently approximate those divergence-based means.
|
|
08:30-09:00, Paper ThPSAT1.5 | |
Keyword Clustering for Automatic Categorization |
Zhao, Qinpei | School of Computing, Univ. of Eastern Finland |
Rezaei, Mohammad | School of Computing, Univ. of Eastern Finland |
Chen, Hao | School of Computing, Univ. of Eastern Finland |
Fr�nti, Pasi | Univ. of Eastern Finland |
Keywords: Classification and Clustering, Document Understanding, Machine Learning and Data Mining
Abstract: Processing short texts is becoming a trend in information retrieval. Since the text has rarely external information, it is more challenging than document. In this paper, keyword clustering is studied for automatic categorization. To obtain semantic similarity of the keywords, a broad-coverage lexical resource WordNet is employed. We introduce a semantic hierarchical clustering. For automatic keyword categorization, a validity index for determining the number of clusters is proposed. The minimum value of the index indicates the potentially appropriate categorization. We show the result in experiments, which indicates the index is effective.
|
|
08:30-09:00, Paper ThPSAT1.6 | |
Hybdrid Content Based Image Retrieval Combining Multi-Objective Interactive Genetic Algorithm and SVM |
Pighetti, Romaric | Lab. I3S, UMR UNS-CNRS 7271 |
Pallez, Denis | Lab. I3S, UMR UNS-CNRS 7271 |
Precioso, Frederic | Lab. I3S, UMR UNS-CNRS, 7271 |
Keywords: Machine Learning and Data Mining, Classification and Clustering, Multimedia Analysis, Indexing and Retrieval
Abstract: The amount of images contained in repositories or available on Internet has exploded over the last years. In order to retrieve efficiently one or several images in a database, the development of Content-Based Image Retrieval (CBIR) systems has become an intensively active research area. However, most proposed systems are keyword-based and few imply the end-user during the search (through relevance feedback). Visual low-level descriptors are then substituted to keywords but there is a gap between visual description and user expectations. We propose a new framework which combines a multi-objective interactive genetic algorithm, allowing a trade-off between image features and user evaluations, and a support vector machine to learn the user relevance feedback. We test our system on SIMPLIcity database, commonly used in the literature to evaluate CBIR systems using a genetic algorithm, and it outperforms the recent frameworks.
|
|
08:30-09:00, Paper ThPSAT1.7 | |
Statistical Modeling and Signal Selection in Multivariate Time Series Pattern Classification |
Liu, Ruoqian | Univ. of Michigan - Dearborn |
Xu, Shen | Univ. of Michigan - Dearborn |
Fang, Chen | Univ. of Michigan - Dearborn |
Liu, Yung-wen | Univ. of Michigan - Dearborn |
Kochhar, Dev | Ford Motor Company |
Murphey, Yi | Univ. of Michigan-Dearborn |
Keywords: Feature Reduction and Manifold Learning, Statistical, Syntactic and Structural Pattern Recognition, Pattern Recognition for Bioinformatics
Abstract: This paper presents an algorithm for selecting a compact subset of relevant signals for pattern classification problems involving multivariate time series (MTS) data. The algorithm uses a statistical causality modeling method to select relevant signals, and a moving average correlation analysis method to remove redundant signals. The MTS signal selection algorithm was evaluated through a case study: driver wellness classification. From a set of 20 time series signals, the signal selection algorithm selected a subset of 9 signals that are independent and most relevant to the pattern class. We trained a driver wellness classification system using Random Forest (RF) with the input of 20 original signals, and another system with the selected 9 signals. The experiment results show that the system used the selected 9 signals performed better than the system used the original set of 20 signals consistently over different sizes of RF.
|
|
08:30-09:00, Paper ThPSAT1.8 | |
Multimodal Biometric Authentication Based on Iris Pattern and Pupil Light Reflex |
Yano, Vitor | Unicamp |
Zimmer, Alessandro | UFPR |
Ling, Lee Luan | Unicamp |
Keywords: Biometrics, Image and Video Processing, Detection, Separation and Segmentation
Abstract: Biometrics-based authentication is a method of personal identification that has some advantages over the password and object-based ones, mainly for the user, who doesn't need to carry or memorize anything. However, this kind of identification is also subject to problems. Besides the technology-related possibilities of fraud, such as system invasion, database corruption or algorithm injection, some of the common used biometric features can be faked. Furthermore, most cases of false rejection are related to the quality of the acquired sample. This paper proposes a multimodal biometric authentication method which incorporates the use of dynamic features of the human reflex and the iris pattern recognition for a better performance. A prototype system has been implemented and tested with 59 volunteers. Experimental results presented an EER of 2.44%.
|
|
08:30-09:00, Paper ThPSAT1.9 | |
A Study on Semi-Supervised Dissimilarity Representation |
Dinh, Viet Cuong | Delft Univ. of Tech. |
Duin, Robert | TU Delft |
Loog, Marco | Delft Univ. of Tech. / Univ. of Copenhagen |
Keywords: Statistical, Syntactic and Structural Pattern Recognition, Machine Learning and Data Mining, Classification and Clustering
Abstract: In the dissimilarity representation approach, objects are represented by their dissimilarities with respect to a representation set, rather than by features. Up to now, the representation or prototype set has usually been selected from the training data, limiting the different aspects that can be captured, especially when the training data set is small. This paper studies the performance change if the object�s representation is extended by including also test data into the representation set in a semi-supervised setting. Experiments on a set of standard data show that the semi-supervised setting can substantially improve the performance of the dissimilarity based representation especially for the small sample size problem.
|
|
08:30-09:00, Paper ThPSAT1.10 | |
Multi-Class Ada-Boost Classification of Object Poses through Visual and Infrared Image Information Fusion |
Changrampadi, Mohamed | Chalmers Univ. of Tech. |
Yun, Yixiao | Chalmers Univ. of Tech. |
Gu, Irene Yu-Hua | Chalmers Univ. of Tech. |
Keywords: Pattern Recognition for Search, Retrieval and Visualization, Classification and Clustering, Machine Learning and Data Mining
Abstract: This paper presents a novel method for pose classification using fusion of visual and thermal infrared(IR) images. We propose a novel tree structure multi-class classification scheme with visual and IR sub-classifiers. These sub-classifiers are different from the conventional one-against-all or one-against-one strategies, where we handle the multi-class problem directly. We propose to use an accuracy score for the fusion of visual and IR sub-classifiers. In addition, we propose to use the original Haar features plus an extra one, and a multi-threshold weak learner to obtain weak hypothesis. The experimental results on a visual and IR image dataset containing 3018 face images in three poses show that the proposed classifier achieves high classification rate of 99.50% on the test set. Comparisons are made to a fused one-vs-all method, a classifier with visual band only, and a classifier with IR band only. Results provide further support to the proposed method.
|
|
08:30-09:00, Paper ThPSAT1.11 | |
Cluster-Classification Bayesian Networks for Head Pose Estimation |
Kafai, Mehran | Univ. of California, Riverside |
Bhanu, Bir | Univ. of California |
An, Le | Univ. of California, Riverside |
Keywords: Classification and Clustering, Biometrics, Machine Learning and Data Mining
Abstract: Head pose estimation is critical in many applications such as face recognition and human-computer interaction. Various classifiers such as LDA, SVM, or nearest neighbor are widely used for this purpose; however, the recognition rates are limited due to the limited discriminative power of these classifiers for discretized pose estimation. In this paper, we propose a head pose estimation method using a Cluster-Classification Bayesian Network (CCBN), specifically designed for classification after clustering. A pose layout is defined where similar poses are assigned to the same block. This increases the discriminative power within the same block when similar yet different poses are present. We achieve the highest recognition accuracy on two public databases (CAS-PEAL and FEI) compared to the state-of-the-art methods.
|
|
08:30-09:00, Paper ThPSAT1.12 | |
Improving Cross-Validation Based Classifier Selection Using Meta-Learning |
Krijthe, Jesse Hendrik | Delft Univ. of Tech. |
Ho, Tin Kam | Bell Lab. Alcatel-Lucent |
Loog, Marco | Delft Univ. of Tech. / Univ. of Copenhagen |
Keywords: Classification and Clustering, Statistical, Syntactic and Structural Pattern Recognition
Abstract: In this paper we compare classifier selection using cross-validation with meta-learning, using as meta-features both the cross-validation errors and other measures characterizing the data. Through simulation experiments we demonstrate situations where meta-learning offers better classifier selections than ordinary cross-validation. The results provide some evidence to support meta-learning not just as a more time efficient classifier selection technique than cross-validation, but potentially as more accurate. It also provides support for the usefulness of data complexity estimates as meta-features for classifier selection.
|
|
08:30-09:00, Paper ThPSAT1.13 | |
Jensen-Shannon Graph Kernel Using Information Functionals |
Bai, Lu | the Univ. of York |
Hancock, Edwin | Univ. of York |
Ren, Peng | China Univ. of Petroleum (Huadong) |
Keywords: Statistical, Syntactic and Structural Pattern Recognition, Machine Learning and Data Mining, Classification and Clustering
Abstract: In recent work we have shown how to use the von Neumann entropy to construct a Jensen-Shannon kernel on graphs. The kernel is defined as the difference in entropies between a product graph and the separate graphs being compared. To develop this graph kernel further, in this paper we explore how to render the computation of the Jensen-Shannon kernel more efficient by using the information functionals defined by Dehmer to compute the required entropies. We illustrate how the resulting Jensen-Shannon graph kernels can be used for the purposes of graph clustering. Experimental results reveal that the methods gives good classification performance on graphs extracted from an object recognition dataset and several bioinformatics datasets.
|
|
08:30-09:00, Paper ThPSAT1.14 | |
Graph Clustering Using Graph Entropy Complexity Traces |
Bai, Lu | the Univ. of York |
Hancock, Edwin | Univ. of York |
Ren, Peng | China Univ. of Petroleum (Huadong) |
Han, Lin | THE Univ. OF YORK |
Keywords: Machine Learning and Data Mining, Statistical, Syntactic and Structural Pattern Recognition, Pattern Recognition for Bioinformatics
Abstract: In this paper, we aim to present a principled approach to the problem of depth-based complexity characterisation of graphs. Our idea is to decompose graphs into substructures of increasing size, and then to measure the complexity of these substructures using Shannon entropy or von-Neumann entropy. We commence by identifying the dominant vertex in a graph. From the dominant vertex, we construct subgraphs of increasing K layers, so-called semidiameter subgraphs. We then measure how the entropy varies with increasing K layer semidiameter subgraphs. We construct a vector of subgraph entropies for each graph, a depth-based complexity trace, and then perform graph clustering in the principal components space of the vectors. We explore our approach on both synthetic data and datasets from the domain of bioinformatics.
|
|
08:30-09:00, Paper ThPSAT1.15 | |
Face Recognition in Multi-Camera Surveillance Videos |
An, Le | Univ. of California, Riverside |
Bhanu, Bir | Univ. of California |
Yang, Songfan | Univ. of California, Riverside |
Keywords: Pattern Recognition for Surveillance and Security, Biometrics, Classification and Clustering
Abstract: Recognizing faces in surveillance videos becomes difficult due to the poor quality of the probe data in terms of resolution, noise, blurriness, and varying lighting conditions. In addition, the poses of probe data are usually not frontal view, contrary to the standard format of the gallery data. The discrepancy between the two types of the data makes the existing recognition algorithm less accurate in real-world data. In this paper, we propose a multi-camera video based face recognition framework using a novel image representation called Unified Face Image (UFI), which is synthesized from multiple camera feeds. Within a temporal window the probe frames from different cameras are warped towards a template frontal face and then averaged. The generated UFI is a frontal view of the subject that incorporates information from different cameras. We use SIFT flow as a high level alignment tool to warp the faces. Experimental results show that by using the fused face, the recognition performance is better than the result of any single camera. The proposed framework can be adapted to any multi-camera video based recognition method using any feature descriptors or classifiers.
|
|
08:30-09:00, Paper ThPSAT1.16 | |
Digital Privacy: Replacing Pedestrians from Google Street View Images |
Nodari, Angelo | Univ. degli studi dell'Insubria |
Vanetti, Marco | Univ. degli Studi dell'Insubria |
Gallo, Ignazio | Univ. degli Studi dell'Insubria |
Keywords: Pattern Recognition for Surveillance and Security, Inpainting and Superimposing, Detection, Separation and Segmentation
Abstract: Given the lack of modern techniques to ensure the digital privacy of individuals, we want to pave the way for a new approach to make pedestrians in cityscape images anonymous. To address these concerns, we propose an automated method to replace any unknown pedestrian with another one which is extracted from a controlled and authorized dataset. The techniques used up to now to make people anonymous are based mainly on the blurring of people�s faces, but even so it is possible to trace the identity of the subject starting from his clothing, personal items, hairstyle, the place and time where the photo was taken. The proposed method aims to make the pedestrians completely anonymous, and consists of four phases: firstly we identify the area where the pedestrian is located, we separate the pedestrian from the background, we select the most similar pedestrian from a controlled dataset and subsequently we substitute it. Our case study is Google Street View because it is one of the online services which suffers most from this kind of privacy issues. The experimental results show how this technique can overcome the problems of digital privacy with promising results.
|
|
08:30-09:00, Paper ThPSAT1.17 | |
Thresholding-Based Segmentation Revisited Using Mixtures of Generalized Gaussian Distributions |
Boulmerka, Aissa | �cole Nationale Sup�rieure d'informatique |
Allili, Mohand Said | Univ. du Qu�bec en Outaouais |
Keywords: Classification and Clustering, Segmentation, Color and Texture, Statistical, Syntactic and Structural Pattern Recognition
Abstract: This paper presents a new approach to image-thresholding-based segmentation. It considerably improves existing methods by efficiently modeling non-Gaussian and multi-modal class-conditional distributions. The proposed approach seamlessly: 1) extends the Otsu's method to arbitrary numbers of thresholds and 2) extends the Kittler and Illingworth minimum error thresholding to non-Gaussian and multi-modal class-conditional data. We use the recently-proposed mixture of generalized Gaussian distributions (MoGG) modeling, which enables to efficiently represent heavy-tailed data, as well as multi-modal histograms with flat and sharply-shaped peaks. Experiments performed on synthetic data and real-world image segmentation show the performance of the proposed approach with comparison to recent state-of-the-art techniques.
|
|
08:30-09:00, Paper ThPSAT1.18 | |
Face Analysis of Aggressive Moods in Automobile Driving Using Mutual Subspace Method |
Moriyama, Tsuyoshi | Tokyo Pol. Univ. |
Khiat, Abdelaziz | Nissan Motor Co., Ltd. |
Shimomura, Noriko | Nissan Motor co., ltd. |
Keywords: Gesture and Behavior Analysis, Pattern Recognition for Bioinformatics, Feature Reduction and Manifold Learning
Abstract: Aggressive affections of automobile drivers such as irritation often cause unpleasant experiences and ultimately road rage. Detecting their cues from drivers' behaviors and obviating undesirable consequences is the most important role of automobile navigation for future safe driving. Facial expressions have been found to be a useful indicator of the driver's affection due to the robustness in monitoring drivers compared with other sensors. Affection consists of two kinds of factors: emotion (impulsive and strong) and mood (long lasting and subtle), where mood biases what kind of emotions to come up. Although moods dominate emotions, conventional approach in facial expression analysis has focused on emotion rather than mood in this context. The technical difficulty in analyzing moods is that there is no neutral expression that has been used as the firm reference for classifying facial expressions because the neutral is the mood itself and varies over time. The proposed method parameterizes appearance changes of face image sequence using mutual subspace method, and estimates the levels of aggressive mood, i.e., irritation and tense. Experimental results that used simulated facial expressions gave the optimal configuration of the proposed method.
|
|
08:30-09:00, Paper ThPSAT1.19 | |
Automated Apple Stem End and Calyx Detection Using Evolution-COnstructed Features |
Lillywhite, Kirt | Brigham Young Univ. |
Tippetts, Beau | Brigham Young Univ. |
Lee, Dah-Jye | Brigham Young Univ. |
Keywords: Statistical, Syntactic and Structural Pattern Recognition, Features and Image Descriptors
Abstract: A majority of consumers list flavor, unbruised and unblemished, and crispness as being the most important characteristics of apples. There is a need for reliable automatic inspection processes to identify bruises and blemishes allowing apples to be sent to the fresh fruit market. Several references state that distinguishing stem end and calyx from true defects is the main challenge for automated apple sorting systems. This research presents the application of the general object recognition algorithm, Evolutionary COnstructed features, to the problem of correctly distinguishing bruises and blemishes from the stem end and calyx of apples. The use of this algorithm demonstrates the feasibility of using machine vision technology with off-the-shelf optical and electronics components to detect true bruises and blemishes on apples with 94% accuracy.
|
|
08:30-09:00, Paper ThPSAT1.20 | |
Top-K Correlated Subgraph Query for Data Streams |
Pan, Shirui | Univ. of Tech. Sydney |
Zhu, Xingquan | Florida Atlantic Univ. |
Fang, Meng | Univ. of Tech. Sydney |
Keywords: Machine Learning and Data Mining, Statistical, Syntactic and Structural Pattern Recognition
Abstract: Given a query graph q, correlated subgraph query intends to find graph structures which are mostly correlated to q. This problem is fundamental for many pattern recognition applications involving structured data like graphs. Current available studies on correlation mining from graph data are all designed for static datasets. However, in real-life applications, data may arrive continuously in a streaming fashion with high speed. In this paper we investigate the problem of top-k correlated subgraphs query over stream. By employing Hoeffding bound into the candidate discovery process and carefully maintaining a candidate list over stream, a novel algorithm, Hoe-PG, is proposed to incrementally identify the top-k correlated subgraphs in a sliding window over stream. Experiments show that the proposed method is several times more efficient than its peer with respect to the runtime and the memory consumption, and is able to maintain high precision and recall for stream-based graph query.
|
|
08:30-09:00, Paper ThPSAT1.21 | |
Timed and Probabilistic Automata for Automatic Animal Call Recognition |
Duan, Shufei | QUEENSLAND Univ. OF Tech. |
Zhang, Jinglan | QUEENSLAND Univ. OF Tech. |
Roe, Paul | QUEENSLAND Univ. OF Tech. |
Towsey, Michael | QUEENSLAND Univ. OF Tech. |
Buckingham, Lawrence | QUEENSLAND Univ. OF Tech. |
Keywords: Statistical, Syntactic and Structural Pattern Recognition, Machine Learning and Data Mining, Classification and Clustering
Abstract: Automatic Call Recognition is vital for environmental monitoring. Patten recognition has been applied in automatic species recognition for years. However, few studies have applied formal syntactic methods to species call structure analysis. This paper introduces a novel method to adopt timed and probabilistic automata in automatic species recognition based upon acoustic components as the primitives. We demonstrate this through one kind of birds in Australia: Eastern Yellow Robin.
|
|
08:30-09:00, Paper ThPSAT1.22 | |
Automatic Fuzzy Clustering Based on Mistake Analysis |
Ben, Shenglan | NanjingUniversity of Science and Tech. |
Jin, Zhong | Nanjing Univ. of Science and Tech. |
Yang, Jingyu | Nanjing Univ. of Science and Tech. |
Keywords: Classification and Clustering
Abstract: This paper presents a robust fuzzy clustering algorithm which can perform clustering without pre-assigning the number of clusters and is not sensitive to the initialization of cluster centers. This is achieved by iteratively splitting and merging operations under the guidance of mistake measurements. In every step of the iteration, we first split the cluster containing data points belonging to different classes, and then merge the wrongly divided cluster pair. A validity index is proposed based on the two mistake measurements to determine the termination of the clustering process. Experimental results confirm the effectiveness and robustness of the proposed clustering algorithm.
|
|
08:30-09:00, Paper ThPSAT1.23 | |
Utilizing Co-Occurrence Patterns for Semantic Concept Detection in Images |
Feng, Linan | Univ. of California Riverside |
Bhanu, Bir | Univ. of California |
Keywords: Pattern Recognition for Search, Retrieval and Visualization, Image and Video Understanding, Detection, Separation and Segmentation
Abstract: Semantic concept detection is an important open problem in concept-based image understanding. In this paper, we develop a method inspired by social network analysis to solve the semantic concept detection problem. The novel idea proposed is the detection and utilization of concept co-occurrence patterns as contextual clues for improving individual concept detection. We detect the patterns as hierarchical communities by graph modularity optimization in a network with nodes and edges representing individual concepts and co-occurrence relationships. We evaluate the effect of detected co-occurrence patterns in the application scenario of automatic image annotation. Experimental results on SUN�09 and OSR datasets demonstrate our approach achieves significant improvements over popular baselines.
|
|
08:30-09:00, Paper ThPSAT1.24 | |
Hypergraph Matching Based on Marginalized Constrained Compatibility |
Su, Jiang | Univ. of electronic science and Tech. of China |
Le, Dong | Univ. of electronic science and Tech. of China |
Ren, Peng | China Univ. of Petroleum (Huadong) |
Hancock, Edwin | Univ. of York |
Keywords: Statistical, Syntactic and Structural Pattern Recognition
Abstract: We aim to match two hypergraphs via pairwise characterization of multiple relationships. To this end, we introduce a technique referred to as Marginalized Constrained Compatibility Estimation (MCCE), which transforms the compatibility tensor representing hyperedge similarities into a compatibility matrix representing edge similarities. We then cluster graph vertices associated with the compatibility matrix and extract its dominant set as the optimal matches. Our MCCE-based method overcomes the information loss arising in arithmetic average, which is commonly used for marginalization in the hypergraph matching literature. Experiments demonstrate the effectiveness of our method.
|
|
08:30-09:00, Paper ThPSAT1.25 | |
Bias Analyses of Spontaneous Facial Expression Database |
Wang, Zhaoyu | Univ. ofScience and Tech. of China, Hefei, Anhui,P.R.C |
Wang, Shangfei | Univ. of Science and Tech. of China |
Zhu, Yachen | Univ. of Science and Tech. of China |
Ji, Qiang | RPI |
Keywords: Gesture and Behavior Analysis
Abstract: In this paper, cross-corpora evaluations are used to analyze the bias of spontaneous facial expression databases. Local binary pattern, Gabor, eigenface and fisherface features are extracted and applied to the four spontaneous expression databases: USTC-NVIE, VAM, Belfast Naturalistic and SEMAINE to recognize arousal (high/low) and valance (positive/negative) respectively. Experimental results indicate that there exists bias a- mong different spontaneous expression databases. The emotion-induction methods, the variety of subjects and the quantity of raters may have caused such a bias.
|
|
08:30-09:00, Paper ThPSAT1.26 | |
Multiple HOG Templates for Gait Recognition |
Liu, Yushu | Fudan Univ. |
Zhang, Junping | Fudan Univ. |
Wang, Chen | Fudan Univ. |
Wang, Liang | Inst. of Automation, Chinese Acad. of Sciences |
Keywords: Biometrics, Features and Image Descriptors, Image and Video Processing
Abstract: In gait recognition field, template-based approaches such as Gait Energy Image (GEI) and Chrono-Gait Image (CGI) can achieve good recognition performance with low computational cost. Meanwhile, CGI can preserve temporal information better than GEI. However, they pay less attention to the local shape features. To preserve temporal information and generate more abundant local shape features, we generate multiple HOG templates by extracting Histogram of Oriented Gradients (HOG) of GEI and CGI templates. Experiments show that compared with several published approaches, our proposed multiple HOG templates achieve better performance for gait recognition.
|
|
08:30-09:00, Paper ThPSAT1.27 | |
Distance Matrices As Invariant Features for Classifying MoCap Data |
Vieira, Antonio Wilson | Univ. Federal de Minas Gerais |
Lewiner, Thomas | PUC-Rio |
Schwartz, William | Federal Univ. of Minas Gerais |
Campos, Mario Montenegro Campos | Univ. Federal de Minas Gerais |
Keywords: Gesture and Behavior Analysis, Classification and Clustering, Statistical, Syntactic and Structural Pattern Recognition
Abstract: This work introduces a new representation for Motion Capture data (MoCap) that is invariant under rigid transformation and robust for classification and annotation of MoCap data. This representation relies on distance matrices that fully characterise the class of identical postures up to the body position or orientation. This high dimensional feature descriptor is tailored using PCA and incorporated into an action graph based classification scheme. Classification experiments on publicly available data show the accuracy and robustness of the proposed MoCap representation.
|
|
08:30-09:00, Paper ThPSAT1.28 | |
Inference Bag of Features Using Sparse Coding for Image Classification |
Peng, Yu | Univ. of Newcastle, Australia |
Min, Xu | Univ. of Tech. Sydney |
Jin, Jesse | Univ. of Newcastle, Australia |
Luo, Suhuai | Univ. of Newcastle, Australia |
Ni, Zefeng | Univ. of Tech. Sydney |
Keywords: Classification and Clustering, Image and Video Processing, Features and Image Descriptors
Abstract: In this paper, we originally propose an inference bag of features (BoF) method for image classification. Current BoF methods construct visual word dictionary (VWD) from training images. More training data are desired for higher classification rate. However, more training data increase size of visual word dictionary (VWD) as well as testing time. Fixed size of VWD in current methods guarantee processing speed, but would miss available training data. Our method addresses this dilemma. We use three sets of images: training, inference and testing images. Using sparse coding, VWD is constructed from inference images, the amount of which is fixed. Posterior probabilities of visual words over classes are learned from training images in a Bayesian framework. In testing, testing images are represented by visual words in VWD. The choices of representing visual words determine classification decision. We compared our method with two popular methods on gender classification and vehicle type classification. We achieved promising results.
|
|
08:30-09:00, Paper ThPSAT1.29 | |
Efficient Sequence Kernel-Based Genome-Wide Prediction of Transcription Factors |
Kuksa, Pavel | NEC Lab. America Inc. |
Keywords: Pattern Recognition for Bioinformatics, Classification and Clustering, Machine Learning and Data Mining
Abstract: With whole genome sequences of many organisms readily available, and lack of full functional characterization of the genes, computational functional analysis of whole genomes is a target of intensive research. Of a particular interest is prediction of regulatory functions, such as regulation of gene expression by transcription factors (TFs), proteins that bind to DNA to promote or suppress transcription of their target genes. Identification of these transcription factors at the genome level (i.e. from their sequence) lays a basis for further analysis and understanding of gene regulatory networks and can serve as a starting point for targeted high-throughput experiments. In this work, we address a question of predicting whether a (uncharacterized) protein is a transcription factor or not given its amino acid sequence. We cast this problem as classification task: we use sequence features as input variables and output functional class (TF or non-TF). We show that our proposed method can identify with high accuracy TFs at whole genome level both within given organism and across different organisms, as well as identify novel TF families with high accuracy.
|
|
08:30-09:00, Paper ThPSAT1.30 | |
Predicting Battery Life from Usage Trajectory Patterns |
Takahashi, Toshihiro | IBM |
Ide, Tsuyoshi | IBM |
Keywords: Machine Learning and Data Mining, Feature Reduction and Manifold Learning, Statistical, Syntactic and Structural Pattern Recognition
Abstract: This paper addresses the task of predicting the battery capacity degradation ratio for a given usage pattern. This is an interesting pattern recognition task, where each usage pattern is represented as a trajectory in a feature space, and the prediction model captures the previous usage trajectory patterns. The main technical challenge here is how to build a good model from a limited number of training samples. To tackle this, we introduce a new smoothing technique in the trajectory space. The trajectory smoothing technique is shown to be equivalent of a novel regularization scheme for linear regression. Using real Li-ion battery data, we show that our approach outperforms existing methods.
|
|
08:30-09:00, Paper ThPSAT1.31 | |
An Improved K-Means Document Clustering Using Wikipedia Hierarchical Ontology |
Hassan, Mostafa | Centre for Pattern Analysis and MachineIntelligence(CPAMI),Univ. |
Karray, Fakhri | Univ. of waterloo |
Kamel, Mohamed S | Univ. of Waterloo |
Keywords: Machine Learning and Data Mining, Classification and Clustering, Statistical, Syntactic and Structural Pattern Recognition
Abstract: Text document clustering is one of the crucial tasks in text mining. It is used in many different text mining applications. One of the most commonly used algorithms for document clustering is the k-means algorithm, the main drawback of which is that its output performance is very sensitive to its initial clusters� centroids. In this work, we present a technique to initialize the centroids based on background knowledge structure extracted from one of the largest online knowledge repositories: Wikipedia. Results show that the proposed model is efficient, and promising, as it outperforms the accuracy of the conventional k-means clustering, as well as other conventional algorithms for document clustering.
|
|
08:30-09:00, Paper ThPSAT1.32 | |
Multi-Task Co-Clustering Via Nonnegative Matrix Factorization |
Xie, Saining | Shanghai Jiao Tong Univ. |
Lu, Hongtao | Shanghai Jiao Tong Univ. |
He, Yangcheng | Shanghai Jiao Tong Univ. |
Keywords: Machine Learning and Data Mining, Classification and Clustering, Document Understanding
Abstract: Recent results have empirically proved that, given several related tasks with different data distributions and an algorithm that can utilize both the task-specific and cross-task knowledge, clustering performance of each task can be significantly enhanced. This kind of unsupervised learning method is called multi-task clustering. We focus on tackling the multi-task clustering problem via a 3-factor nonnegative matrix factorization. The object of our approach consists of two parts: (1) Within-task co-clustering: co-cluster the data in the input space individually. (2) Cross-task regularization: Learn and refine the relations of feature spaces among different tasks. We show that our approach has a sound information theoretic background and the experimental evaluation shows that it outperforms many state-of-the-art single-task or multi-task clustering methods.
|
|
08:30-09:00, Paper ThPSAT1.33 | |
Designing Various Component Analysis at Will |
Kimura, Akisato | NTT Corp. |
Sugiyama, Masashi | Tokyo Inst. of Tech. |
Sakano, Hitoshi | NTT |
Kameoka, Hirokazu | NTT Corp. |
Keywords: Statistical, Syntactic and Structural Pattern Recognition, Machine Learning and Data Mining, Feature Reduction and Manifold Learning
Abstract: This paper provides a generic framework of component analysis (CA) methods introducing a new expression for scatter matrices and Gram matrices, called Generalized Pairwise Expression (GPE). This expression is quite compact but highly powerful. The framework includes not only (1) the standard CA methods but also (2) several regularization techniques, (3) weighted extensions, (4) some clustering methods, and (5) their semi-supervised extensions. This paper also presents quite a simple methodology for designing a desired CA method from the proposed framework: Adopting the known GPEs as templates, and generating a new method by combining these templates appropriately.
|
|
08:30-09:00, Paper ThPSAT1.34 | |
Hierarchical Multilevel Object Recognition Using Markov Model |
Attamimi, Muhammad | The Univ. of Electro-Communications |
Nakamura, Tomoaki | The Univ. of Electro-Communications |
Nagai, Takayuki | The Univ. of Electro-Communications |
Keywords: Statistical, Syntactic and Structural Pattern Recognition, 2D/3D Object Detection and Recognition, Vision for Robotics
Abstract: In this study, we address the issue on multilevel object recognition. The multilevel object recognition is object recognition in various levels, that is, simultaneous recognition of its instance, category, material, etc. At each level, many recognition methods have been proposed in the literature. Therefore it is straightforward to design a multilevel object recognition system using conventional methods independently. However, these "levels" are related each other and form hierarchical structure. Hence the recognition performance can be improved by considering consistency of the recognition results at all levels. To model the consistency, we formulate the problem as finding the Viterbi path in a Markov model, since the consistent recognition results can be thought of as the most likely sequence of the states. We implemented the proposed multilevel object recognition system and evaluated it to show validity.
|
|
08:30-09:00, Paper ThPSAT1.35 | |
A Linear Max K-Min Classifier |
Dong, Mingzhi | Beijing Univ. of Posts and Telecommunications |
Yin, Liang | Beijing Univ. of Posts and Telecommunications |
Deng, Weihong | Beijing Univ. of Posts and Telecommunications |
Wang, Qiang | Beijing Univ. of Posts and Telecommunications |
Yuan, Caixia | Beijing Univ. of Posts and Telecommunications(BUPT).P.R.Ch |
Guo, Jun | Beijing Univ. of Posts and Telecommunications |
Shang, Li | Intel China Res. Center |
Ma, Liwei | Intel China Res. Center |
Keywords: Classification and Clustering
Abstract: Over the past decades, the mathematical modeling of classifier has always been a hot topic in the field of pattern recognition. Maximin classifier, which pays strong attention to the worst instance of each class, has achieved excellent performance in a great number of applications. However, the maximin classifiers only consider the most boundary point/points of each class. Thus this paper proposes a more robust Linear Max K-min (LMKM) Classifier for 2-class classification problems by finding a hyperplane which best classifies K-worst cases. The original objective function is reformulated into a linear programming problem with 2N constraints which can be solved with high computational efficiency, where N indicates the number of training samples. Our algorithm is tested in 18 publicly available 2-class classification datasets and the experiment results show that the classification performance of LMKM is competitive with Linear Support Vector Machine (SVM) and Logistic Regression (LR).
|
|
08:30-09:00, Paper ThPSAT1.36 | |
A Dual-Staged Classification-Selection Approach for Automated Update of Biometric Templates |
Rattani, Ajita | Univ. of Cagliari |
Marcialis, Gian Luca | Univ. of Cagliari |
Granger, Eric | �cole de Tech. sup�rieure |
Roli, Fabio | Univ. of Cagliari |
Keywords: Biometrics
Abstract: In the emerging field of adaptive biometrics, systems aim to adapt enrolled templates to variations in samples observed during operations. However, despite numerous advantages, few commercial vendors have adopted auto-update procedures in their products. This is due to limitations associated with existing adaptation schemes. This paper proposes a dual-staged template adaptation scheme that allows to capture 'informative' operational samples with significant variations but without increasing the vulnerability to impostor intrusion. This is achieved through a two staged classification-selection approach driven by the harmonic function and risk minimization technique, over a graph based representation of (enrolment and operational) samples. Experimental results on the DIEE fingerprint data set, explicitly collected for evaluating adaptive biometric systems, demonstrate that the proposed scheme results in 67% reduction in error over the baseline system (without adaptation), outperforming state-of-the-art methods.
|
|
08:30-09:00, Paper ThPSAT1.37 | |
The Bayesian Logistic Regression in Pattern Recognition Problems under Concept Drift |
Turkov, Pavel | Tula State Univ. |
Krasotkina, Olga | Tula State Univ. |
Mottl, Vadim | Computing Center of the Russian Acad. of Sciences |
Keywords: Machine Learning and Data Mining, Classification and Clustering
Abstract: The practice always makes us face the challenge of processing pattern recognition data flows with time-varying target concept, i.e., changing statistical relationship between class memberships and observable characteristics of entities to be perceived by the recognition system. In this paper, a mathematical and algorithmic framework is proposed for handling the concept drift in pattern recognition problems on the basis of the Bayesian treatment of logistic regression as an appropriate mathematical instrument for inferring a time-varying decision rule. The pattern recognition procedure resulting from this approach is a numerical implementation of the general dynamic programming principle, and has the linear computational complexity with respect to the length of the time series, in contrast to the polynomial complexity of pattern recognition procedures of general kind.
|
|
08:30-09:00, Paper ThPSAT1.38 | |
Adaptive Selection of Ensembles for Imbalanced Class Distributions |
Radtke, Paulo Vinicius Wolski | �cole de Tech. sup�rieure |
Granger, Eric | �cole de Tech. sup�rieure |
Sabourin, R. | �cole de Tech. sup�rieure |
Gorodnichy, Dmitry | Canada Border Services Agency |
Keywords: Pattern Recognition for Surveillance and Security, Biometrics
Abstract: Boolean combination (BC) techniques have been shown to efficiently integrate the responses of multiple diversified classifiers in the ROC space to improve the overall accuracy and reliability of pattern recognition systems. In practice, since class distributions are often imbalanced and change over time, the BC of classifiers, and thus selection of ensembles, should be adapted to reflect operational conditions. Although the impact on classification performance of imbalanced distributions may be addressed using ensemble-based techniques, this is difficult to observe from ROC curves. However, given a desired false positive rate and class imbalance, performing BC in the Precision-Recall Operating Characteristic (PROC) space with skewed data may lead to a higher level of performance. In this paper, an adaptive system is proposed that initially generates several PROC curves, each one from data with a different level of skew. Then, during operations, the class imbalance is periodically estimated, and used to approximate the most accurate BC of classifiers among operational points of these curves. Simulation results indicate that this approach maintains a high level of accuracy that is comparable to full Boolean re-combination (as required for a specific level of imbalance), but for a significantly lower computational cost.
|
|
08:30-09:00, Paper ThPSAT1.39 | |
Applying Scattering Operators for Face Recognition: A Comparative Study |
Chang, Kuang-Yu | Acad. Sinica |
Lin, Cheng-Fu | Res. center for Information Tech. Innovation, Acad. |
Chen, Chu-Song | Acad. Sinica |
Hung, Yi-Ping | National Taiwan Univ. |
Keywords: Pattern Recognition for Bioinformatics
Abstract: Face identification is the problem of determining whether two face images depict the same person or not. This is difficult due to variations in scale, pose, lighting, background, expression, hairstyle, and glasses. Thus, a powerful feature descriptor with local-deformation tolerance ability and discriminating capability is essential to fulfill all these variations. In this paper, we present a local descriptor, scattering operator, which includes multi-scale and multi-direction co-occurrence information. It is computed with a cascade of wavelet decompositions and complex modulus. This scattering representation is locally translation invariant and can linearize deformations. We evaluate the abilities of this Gabor-based scattering operator by an effective face recognition paradigm and show that this descriptor outperforms the compared descriptors.
|
|
08:30-09:00, Paper ThPSAT1.40 | |
Single-Frame Hand Gesture Recognition Using Color and Depth Kernel Descriptors |
Zhu, Xiaolong | The Univ. of Hong Kong |
Wong, Kwan-Yee Kenneth | The Univ. of Hong Kong |
Keywords: Gesture and Behavior Analysis, Features and Image Descriptors, Human Computer Interaction
Abstract: This paper presents a flexible method for single-frame hand gesture recognition by fusing information from color and depth images. Existing methods usually focus on designing intuitive features for color and depth images. On the contrary, our method first extracts common patch-level features, and fuses them by means of kernel descriptors. Linear SVM is then adopted to predict the class label efficiently. In our experiments on two American Sign Language (ASL) datasets, we demonstrate that our approach recognizes each sign accurately with only a small number of training samples, and is robust to the change of distance between the hand and the camera.
|
|
08:30-09:00, Paper ThPSAT1.41 | |
An Efficient Method for Occluded Face Recognition |
Liu, Wentao | Tsinghua Univ. |
Xie, Xudong | Tsinghua Univ. |
Lam, Kin-Man | The Hong Kong Pol. Univ. |
Keywords: Biometrics
Abstract: During the last two decades, a series of subspace methods have succeeded in achieving a satisfactory performance for face recognition tasks, but have always failed when partial occlusions occur. This paper combines the subspace techniques with probabilistic models, and aims at achieving invariance to occlusions. The concept underlying the proposed method is that two faces with the same identity, even though one of them is partially occluded, tend to be similar in the uncorrupted areas. The similarity value measured from the error distributions can then be exploited for identification. Experiments show the robustness of this novel method against various kinds of occlusion.
|
|
08:30-09:00, Paper ThPSAT1.42 | |
Collaborative PLSA for Multi-View Clustering |
Jiang, Yu | Inst. of Automation, Chinese Acad. of Science |
Liu, Jing | National Lab. of Pattern Recognition,Inst. |
Li, Zechao | Inst. of Automation, Chinese Acad. of Science |
Lu, Hanqing | Inst. of Automation,Chinese Acad. of Science |
Keywords: Classification and Clustering, Machine Learning and Data Mining
Abstract: In real world, data has multi-view representations from different feature spaces. Multi-view clustering algorithms allow leveraging information from multiple views of the data and this may substantially improve the clustering result obtained by using a single view. In this paper, we propose a novel algorithm called Collaborative PLSA (C-PLSA) for multi-view clustering, which works on the assumption that the clustering from one view should agree with the clustering from another view. The proposed C-PLSA combines individual PLSA models on two different views, and imports a regularizer to force the both clustering results agree across the two views. To solve the regularized problem, an alternating optimization algorithm based on generalized EM (GEM) is adopted for maximum likelihood estimation. Experiments on two real-world datasets, i.e., Reuters multilingual text and Corel images, demonstrate the improved performance of our proposed method over some related work.
|
|
08:30-09:00, Paper ThPSAT1.43 | |
A Cross-Device Matching Fingerprint Database from Multi-Type Sensors |
Jia, Xiaofei | Insititute of Automation, Chinese Acad. of Sciences |
Yang, Xin | Insititute of Automation, Chinese Acad. of Sciences |
Zang, Yali | Inst. of Automation, Chinese Acad. of Sciences |
Zhang, Ning | Insititute of Automation, Chinese Acad. of Sciences |
Tian, Jie | Insititute of Automation, Chinese Acad. of Sciences |
Keywords: Biometrics
Abstract: Databases play an important role in evaluating the performance of fingerprint identification algorithms. But which can be used to test the interoperability? That is to say, few of databases can test the performance of an algorithm on images acquired by different sensors. In order to solve the problem, we create the FingerPass cross-device matching fingerprint database which consists of almost 80 thousand fingerprint images from 90 subjects on nine different fingerprint sensors. We take both technology type and interaction type into consideration when choosing the sensors, totally different from other databases. It can test the interoperability of an algorithm at both the sensor level and the sensor type level. So we can use the FingerPass to test the performance of a cross-device matching algorithm for sensors of a specific type or different types . We apply the VeriFinger fingerprint recognition algorithm on it, and the experimental results indicate that the FingerPass cross-device matching database is a challenge for fingerprint algorithms.
|
|
08:30-09:00, Paper ThPSAT1.44 | |
Object Categorization Via Sparse Representation of Local Features |
Wang, Jin | Deakin Univ. Australia |
Sun, Xiangping | Deakin Univ. |
Chen, Ronghua | Deakin Univ. |
She, Mary Fenghua | Deakin Univ. |
Wang, Qiang | CSR Zhuzhou Inst. CO.,LTD, China |
Keywords: Classification and Clustering, Features and Image Descriptors, Image and Video Understanding
Abstract: Sparse representation has been introduced into the computer vision to address many recognition problems. In this paper, we propose a new framework for object categorization based on sparse representation of local features. Unlike most of previous sparse coding based methods in object classification that only use sparse coding to extract high-level features, the proposed method incorporates sparse representation and classification into a unified framework. Therefore, it does not need a further classifier. Experimental results show that the proposed method achieved better or comparable accuracy than the well known bag-of-features representation with various classifiers.
|
|
08:30-09:00, Paper ThPSAT1.45 | |
Hash-Based Structural Similarity for Semi-Supervised Learning on Attribute Graphs |
Hido, Shohei | Preferred Infrastructure |
Kashima, Hisashi | The Univ. of Tokyo |
|
|
08:30-09:00, Paper ThPSAT1.46 | |
Commensurate Dimensionality Reduction for Extended Local Ternary Patterns |
Liao, Wen-Hung | National Chengchi Univ. |
Keywords: Feature Reduction and Manifold Learning, Classification and Clustering
Abstract: We present a systematic approach to reduce the dimensionality of the feature vector for local binary/ternary patterns. The proposed framework examines the distribution of uniform patterns in different image sets to formulate a procedure to assign dimensionality to uniform and non-uniform patterns. Unlike previous methods where all the information from non-uniform patterns is discarded or merged into a single dimension, the proposed commensurate dimensionality reduction (CDR) technique attempts to retain valuable information from all contributory factors. Experiments and comparative analysis have validated the efficacy of the newly defined CDR-ELTP descriptor in terms of noise resistance and texture classification.
|