WhatToSee
- -Tolerance Closed Frequent Itemsets
- 1
- 1
- A Balanced Ensemble Approach to Weighting Classifiers for Text Classification
- A Data Mining Approach for Capacity Building of Stakeholders in Integrated Flood Management Peter Owotoki, Natasa Manojlovi , Friedrich Mayer-Lindenberg, Erik Pasche
- A Feature Selection and Evaluation Scheme for Computer Virus Detection
- A Framework for Regional Association Rule Mining in Spatial Datasets
- A Novel Scalable Algorithm for Supervised Subspace Learning Jun Yan1, Ning Liu1, Benyu Zhang1, Qiang Yang2, Shuicheng Yan3, Zheng Chen1 Microsoft Research Asia, 49 Zhichun Road, Beijing 100080, P.R. China {junyan, ningl, byzhang, zhengc}@microsoft.com Department of Computer Science, Hong Kong University of Science and Technology qyang@cs.ust.hk ECE department, University of Illinois at Urbana Champaign, USA
- A Parameterized Probabilistic Model of Network Evolution for Supervised Link Prediction
- A Simple Yet Effective Data Clustering Algorithm Soujanya Vadapalli Satyanarayana R Valluri Kamalakar Karlapalem Center for Data Engineering, IIIT, Hyderabad, INDIA {soujanya, satya, kamal}@iiit.ac.in
- AC-Close: Efficiently Mining Approximate Closed Itemsets by Core Pattern Recovery Hong Cheng Philip S. Yu Jiawei Han
- Accelerating Newton Optimization for Log-Linear Models through Feature Redundancy
- Active Learning to Maximize Area Under the ROC Curve Matt Culver, Deng Kun, and Stephen Scott Dept. of Computer Science 256 Avery Hall University of Nebraska Lincoln, NE 68588-0115 {mculver,kdeng,sscott}@cse.unl.edu
- Adaptive Blocking: Learning to Scale Up Record Linkage
- Adaptive Kernel Principal Component Analysis With Unsupervised Learning of Kernels Daoqiang Zhang Zhi-Hua Zhou National Laboratory for Novel Software Technology Nanjing University, Nanjing 210093, China {zhangdq, zhouzh}lamda.nju.edu.cn Songcan Chen Department of Computer Science and Engineering NUAA, Nanjing 210016, China s.chen@nuaa.edu.cn
- Adaptive Parallel Graph Mining for CMP Architectures
- Adding Semantics to Email Clustering Hua Li1, Dou Shen2, Benyu Zhang1, Zheng Chen1, Qiang Yang2
- An Efficient Reference-based Approach to Outlier Detection in Large Datasets
- An Experimental Investigation of Graph Kernels on a Collaborative Recommendation Task Francois Fouss, Luh Yen, Alain Pirotte & Marco Saerens Information Systems Research Unit (ISYS) Universite´ catholique de Louvain
- An Information Theoretic Approach to Detection of Minority Subsets in Database Shin Ando Graduate School of Engineering, Yokohama National University ando@ubicg.ynu.ac.jp Einoshin Suzuki Graduate School of Information Science and Electric Engineering, Kyushu University suzuki@i.kyushu-u.ac.jp
- An Interactive Semantic Video Mining and Retrieval Platform Application in Transportation Surveillance Video for Incident Detection Xin Chen and Chengcui Zhang Department of Computer and Information Sciences, University of Alabama at Birmingham Birmingham, Alabama, 35294 USA {chenxin, zhang}@cis.uab.edu
- Anytime Classification Using the Nearest Neighbor Algorithm with Applications to Stream Mining
- Applying Data Mining to Pseudo-Relevance Feedback for High Performance Text Retrieval Xiangji Huang1, Yan Rui Huang2, Miao Wen2, Aijun An2, Yang Liu2 and Josiah Poon3
- Automatic Single-Organ Segmentation in Computed Tomography Images
- Bayesian State Space Modeling Approach for Measuring the Effectiveness of Marketing Activities and Baseline Sales from POS Data Tomohiro Ando Graduate School of Business Administration Keio University 2-1-1 Hiyoshi-Honcho, Kohoku-ku, Yokohama-shi, Kanagawa, 223-8523, Japan andoh@kbs.keio.ac.jp
- Belief Propagation in Large, Highly Connected Graphs for 3D Part-Based Object Recognition Frank DiMaio and Jude Shavlik Computer Sciences Dept. University of WisconsinMadison Madison, WI 53706 {dimaio,shavlik}@cs.wisc.edu
- Biclustering Protein Complex Interactions with a Biclique Finding Algorithm
- Boosting Kernel Models for Regression Ping Sun and Xin Yao School of Computer Science University of Birmingham
- Boosting for Learning Multiple Classes with Imbalanced Class Distribution
- Boosting the Feature Space: Text Classification for Unstructured Data on the Web Yang Song1, Ding Zhou1, Jian Huang2, Isaac G. Councill2,
- COALA : A Novel Approach for the Extraction of an Alternate Clustering of High Quality and High Dissimilarity Eric Bae and James Bailey NICTA Victoria Laboratory Department of Computer Science and Software Engineering University of Melbourne, Australia {kheb,jbailey}@csse.unimelb.edu.au
- COSMIC: Conceptually Specified Multi-Instance Clusters Hans-Peter Kriegel Alexey Pryakhin Matthias Schubert Arthur Zimek Institute for Informatics, Ludwig-Maximilians-Universitat Munchen, Germany http://www.dbs.ifi.lmu.de {kriegel,pryakhin,schubert,zimek }@dbs.ifi.lmu.de
- Chao Liu Dept. of Computer Science University of Illinois-UC Urbana, IL 61801 chaoliu@cs.uiuc.edu
- Cluster Analysis of Time-series Medical Data Based on the Trajectory Representation and Multiscale Comparison Techniques Shoji Hirano Shusaku Tsumoto Department of Medical Informatics, Shimane University, School of Medicine 89-1 Enya-cho, Izumo, Shimane 693-8501, Japan hirano@ieee.org
- Cluster Based Core Vector Machine M Narasimha Murty S K Shevade
- Co-clustering documents and words using Bipartite Isoperimetric Graph Partitioning
- CoMiner: An Effective Algorithm for Mining Competitors from the Web
- Comparison of Descriptor Spaces for Chemical Compound Retrieval and Classification
- Comparisons of K-Anonymization and Randomization Schemes Under Linking Attacks Zhouxuan Teng and Wenliang Du Department of Electrical Engineering and Computer Science Syracuse University, Syracuse, USA Email: zhteng@syr.edu,wedu@ecs.syr.edu
- Converting Output Scores from Outlier Detection Algorithms into Probability Estimates
- Corrective Classification: Classifier Ensembling with Corrective and Diverse Base Learners
- DSTree: A Tree Structure for the Mining of Frequent Sets from Data Streams Carson Kai-Sang Leung Quamrul I. Khan
- Data Mining Approaches to Criminal Career Analysis
- Decision Trees for Functional Variables
- Deploying Approaches for Pattern Refinement in Text Mining Sheng-Tang Wu Yuefeng Li Yue Xu School of Software Engineering and Data Communications Queensland University of Technology, QLD 4001 Australia {s.wu, y2.li, yue.xu}@qut.edu.au
- Detecting Link Spam using Temporal Information Guoyang Shen1,2*, Bin Gao1*, Tie-Yan Liu1, Guang Feng1,3*, Shiji Song2, Hang Li1
- Detection of Interdomain Routing Anomalies Based on Higher-Order Path Analysis
- Dimension Reduction for Supervised Ordering Toshihiro Kamishima and Shotaro Akaho National Institute of Advanced Industrial Science and Technology (AIST) AIST Tsukuba Central 2, Umezono 111, Tsukuba, Ibaraki, 3058568 Japan, mail@kamishima.net (http://www.kamishima.net/) and s.akaho@aist.go.jp
- Direct Marketing When There Are Voluntary Buyers
- Dirichlet Aspect Weighting: A Generalized EM algorithm for Integrating External Data Fields with Semantically Structured Queries by using Gradient Projection Method Atulya Velivelli and Thomas S. Huang Dept. of Electrical and Computer Engineering Beckman Institute for Advanced Science and Technology University of Illinois at Urbana-Champaign Urbana, IL 61801, U.S.A. {velivell, huang}@ifp.uiuc.edu
- Discover Bayesian Networks from Incomplete Data Using a Hybrid Evolutionary Algorithm
- Discovering Partial Orders in Binary Data
- Discovering Unrevealed Properties of Probability Estimation Trees: on Algorithm Selection and Performance Explanation
- Discovery of Collocation Episodes in Spatiotemporal Data Huiping Cao, Nikos Mamoulis, and David W. Cheung Department of Computer Science The University of Hong Kong Pokfulam Road, Hong Kong {hpcao,nikos,dcheung}@cs.hku.hk
- Distances and (Indefinite) Kernels for Sets of Objects Adam Woznica, Alexandros Kalousis, Melanie Hilario ´ University of Geneva, Computer Science Department Rue General Dufour 24, 1211 Geneva 4, Switzerland {woznica, kalousis, hilario}@cui.unige.ch
- Diverse Topic Phrase Extraction through Latent Semantic Analysis
- Efficient Clustering of Uncertain Data
- Enhancing Text Clustering using Concept-based Mining Model Shady Shehata Fakhri Karray Mohamed Kamel Department of Electrical and Computer Engineering University of Waterloo Waterloo, Ontario, Canada N2L 3G1 {shady, karray, mkamel}@pami.uwaterloo.ca
- Entity Resolution with Markov Logic Parag Singla Pedro Domingos Department of Computer Science and Engineering University of Washington Seattle, WA 98195-2350, U.S.A. parag,pedrod@cs.washington.edu
- Entropy-based Concept Shift Detection Peter Vorburger, Abraham Bernstein University of Zurich Department of Informatics Binzmuhlestrasse 14, 8050 Zurich, Switzerland {vorburger, bernstein}@ifi.unizh.ch
- Exploratory Under-Sampling for Class-Imbalance Learning
- Fast On-line Kernel Learning for Trees
- Fast Relevance Discovery in Time Series
- Fei Wang Department of Automation Tsinghua University Beijing, 100084, P.R.China feiwang03@gmail.com Sheng Ma, Liuzhong Yang Vivido Media (Beijing) Inc. Shangdi Development Zone Beijing 100085, China masheng@vividomedia.com.cn Tao Li School of Computer Science Florida International University Miami, FL 33199 taoli@cs.fiu.edu
- Finding "Who Is Talking to Whom" in VoIP Networks via Progressive Stream Clustering
- Forecasting Skewed Biased Stochastic Ozone Days: Analyses and Solutions Kun Zhang1, Wei Fan2, Xiaojing Yuan3, Ian Davidson4,and Xiangshang Li5
- Frequent Closed Itemset Mining Using Prefix Graphs with an Efficient Flow-Based Pruning Strategy H. D. K. Moonesinghe, Samah Fodeh, Pang-Ning Tan Department of Computer Science & Engineering Michigan State University East Lansing, MI 48824 (moonesin, fodehsam, ptan)@cse.msu.edu
- Geometrically Inspired Itemset Mining Florian Verhein, Sanjay Chawla School of Information Technologies, University of Sydney, Australia {fverhein,chawla}@it.usyd.edu.au
- Getting the Most Out of Ensemble Selection Rich Caruana, Art Munson, Alexandru Niculescu-Mizil Department of Computer Science, Cornell University {caruana, mmunson, alexn} @cs.cornell.edu
- Global and Componentwise Extrapolation for Accelerating Data Mining from Large Incomplete Data Sets with the EM Algorithm Chun-Nan Hsu Han-Shen Huang Bo-Hou Yang Institute of Information Science Academia Sinica Nankang, Taipei, Taiwan {chunnan,hanshen,ericyang}@iis.sinica.edu.tw
- Gradual Cube: Customize Profile on Mobile OLAP Jun Li Haofeng Zhou Wei Wang Department of Computing and Information Technology Fudan University Shanghai 200433 042021117@fudan.edu.cn
- GraphRank: Statistical Modeling and Mining of Significant Subgraphs in the Feature Space Huahai He Ambuj K. Singh Department of Computer Science University of California, Santa Barbara Santa Barbara, CA 93106, USA {huahai, ambuj}@cs.ucsb.edu
- Hierarchical Classification by Expected Utility Maximization Korinna Bade, Eyke Hullermeier, Andreas Nurnberger Otto-von-Guericke-University Magdeburg, D-39106 Magdeburg, Germany {kbade,nuernb}@iws.cs.uni-magdeburg.de huellerm@iti.cs.uni-magdeburg.de
- High Quality, Efficient Hierarchical Document Clustering using Closed Interesting Itemsets Hassan H. Malik and John R. Kender Department of Computer Science, Columbia University {hhm2104, jrk}@cs.columbia.edu
- High-Performance Unsupervised Relation Extraction from Large Corpora Binjamin Rozenfeld, Ronen Feldman Bar-Ilan University, Ramat Gan grurgrur@gmail.com, ronenf@gmail.com
- Improving Grouped-Entity Resolution using Quasi-Cliques
- Improving Nearest Neighbor Classifier using Tabu Search and Ensemble Distance Metrics Muhammad Atif Tahir and James Smith School of Computer Science University of the West of England BS16 1QY, Bristol, United Kingdom {muhammad.tahir,james.smith@uwe.ac.uk}
- Improving Personalization Solutions through Optimal Segmentation of Customer Bases Tianyi Jiang, Alexander Tuzhilin New York University tjiang, atuzhili@stern.nyu.edu
- Incremental Mining of Frequent Query Patterns from XML Queries for Caching
- Integrating Features from Different Sources for Music Information Retrieval
- Intelligent Icons: Integrating Lite-Weight Data Mining and Visualization into GUI Operating Systems Eamonn Keogh Li Wei Xiaopeng Xi Stefano Lonardi Jin Shieh Scott Sirowy University of California Riverside {eamonn, wli, xxi, stelo, shiehj, ssirowy}@cs.ucr.edu
- Keyphrase Extraction using Semantic Networks Structure Analysis Chong Huang1, Yonghong Tian2, Zhi Zhou2, Charles X. Ling3 , Tiejun Huang1 Institute of Computing Technology, Chinese Academy of Sciences, Beijing 100080, China
- Large Scale Detection of Irregularities in Accounting Data Stephen Bay, Krishna Kumaraswamy, Markus G. Anderle, Rohit Kumar, David M. Steier Center for Advanced Research, PricewaterhouseCoopers LLP 10 Almaden Blvd, Suite 1600, San Jose, CA 95113 {firstname.initial.lastname}@us.pwc.com
- Latent Dirichlet Co-Clustering M. Mahdi Shafiei and Evangelos E. Milios Faculty of Computer Science, Dalhousie University 6050 University Ave., Halifax, Canada shafiei@cs.dal.ca , eem@cs.dal.ca
- Latent Friend Mining from Blog Data Dou Shen1, Jian-Tao Sun2, Qiang Yang1, Zheng Chen2 Hong Kong University of Science and Technology, Hong Kong {dshen, qyang}@cse.ust.hk
- Learning to Use a Learned Model: A Two-Stage Approach to Classification
- Linear and Non-Linear Dimensional Reduction via Class Representatives for Text Classification
- Local Correlation Tracking in Time Series Spiros Papadimitriou Jimeng Sun Philip S. Yu
- MARGIN: Maximal Frequent Subgraph Mining
- Manifold Clustering of Shapes
- Meta Clustering Rich Caruana, Mohamed Elhawary, Nam Nguyen, Casey Smith Cornell University, Ithaca, New York 14853 {caruana, hawary, nhnguyen, casey}@cs.cornell.edu
- Minimum Enclosing Spheres Formulations for Support Vector Ordinal Regression
- Mining Complex Time-Series Data by Learning Markovian Models
- Mining Correlation between Motifs and Gene Expression Yi Lu, Shiyong Lu, Adrian E. Platts, Stephen A. Krawetz Wayne State University
- Mining Generalized Graph Patterns based on User Examples1
- Mining Latent Associations of Objects Using a Typed Mixture Model --A case study on expert/expertise mining Shenghua BaoI, Yunbo Cao, Bing Liu, Yong YuI, and Hang Li
- Mining Maximal Generalized Frequent Geographic Patterns with Knowledge Constraints
- Mining Maximal Quasi-Bicliques to Co-Cluster Stocks and Financial Ratios for Value Investment Kelvin Sim, Jinyan Li Vivekanand Gopalkrishnan
- Mining for Tree-Query Associations in a Graph Eveline Hoekx and Jan Van den Bussche Hasselt University and transnational University of Limburg Agoralaan D, 3590 Diepenbeek, Belgium {eveline.hoekx, jan.vandenbussche}@uhasselt.be
- Mixed-Drove Spatio-Temporal Co-occurrence Pattern Mining: A Summary of Results Mete Celik1 Shashi Shekhar1 James P. Rogers2 James A. Shine2 Jin Soung Yoo1
- Multi-Tier Granule Mining for Representations of Multidimensional Association Rules
- NewsCATS: A News Categorization And Trading System
- Object Identification with Constraints Steffen Rendle, Lars Schmidt-Thieme Department of Computer Science University of Freiburg, Germany steffen@rendle.de, lst@informatik.uni-freiburg.de
- On Trajectory Representation for Scientific Features
- On the Lower Bound of Local Optimums in K-Means Algorithm
- On the Use of Structure and Sequence-Based Features for Protein Classification and Retrieval Keith Marsolo and Srinivasan Parthasarathy Department of Computer Science and Engineering The Ohio State University Contact: srini@cse.ohio-state.edu
- Opening the Black Box of Feature Extraction: Incorporating Visualization into High-Dimensional Data Mining Processes* Jianting Zhang1, Le Gruenwald2 1 LTER Network Office, the University of New Mexico, Albuquerque, NM, 8713, USA 2 School of Computer Science, University of Oklahoma, Norman, OK 73071, USA and National Science Foundation, 4201 Blvd, Arlington, VA 22230, USA Contact Email: jzhang@lternet.edu, Phone: 1-505-277-0666
- Optimal Segmentation Using Tree Models Robert Gwadera, Aristides Gionis, and Heikki Mannila HIIT, Basic Research Unit Helsinki University of Technology and University of Helsinki Finland
- P3C: A Robust Projected Clustering Algorithm
- Pattern Mining in Frequent Dynamic Subgraphs Karsten M. Borgwardt, Hans-Peter Kriegel, Peter Wackersreuther Institute of Computer Science Ludwig-Maximilians-Universitat Munich, Germany kb|kriegel|wackersr@dbs.ifi.lmu.de
- Personalization in Context: Does Context Matter When Building Personalized Customer Models?
- Probabilistic Enhanced Mapping with the Generative Tabular Model
- Probabilistic Segmentation and Analysis of Horizontal Cells
- Query-Sensitive Similarity Measure for Content-Based Image Retrieval Zhi-Hua Zhou Hong-Bin Dai National Laboratory for Novel Software Technology Nanjing University, Nanjing 210093, China {zhouzh, daihb}@lamda.nju.edu.cn
- Rapid Identification of Column Heterogeneity
- Regularized Least Absolute Deviations Regression and an Efficient Algorithm for Parameter Tuning
- Relational Ensemble Classification
- Resource Management for Networked Classifiers in Distributed Stream Mining Systems Deepak S. Turaga Olivier Verscheure Upendra V. Chaudhari Lisa D. Amini IBM T.J. Watson Research Center Yorktown Heights, NY 10598
- Robert Jaschk
- Rule-Based Platform for Web User Profiling1
- SAXually Explicit Images: Finding Unusual Shapes Li Wei Eamonn Keogh Xiaopeng Xi Department of Computer Science and Engineering University of California, Riverside {wli, eamonn, xxi}@cs.ucr.edu
- STAGGER: Periodicity Mining of Data Streams using Expanding Sliding Windows
- Searching for Pattern Rules
- Secure Distributed k-Anonymous Pattern Mining
- Semantic Kernels for Text Classification based on Topological Measures of Feature Similarity
- Semantic Smoothing for Model-based Document Clustering Xiaodan Zhang, Xiaohua Zhou, Xiaohua Hu College of Information Science & Technology, Drexel University xzhang@ischool.drexel.edu, xiaohua.zhou@drexel.edu, thu@ischool.drexel.edu
- Semi-Supervised Kernel Regression* Meng Wang, Xian-Sheng Hua, Yan Song, Li-Rong Dai, Hong-Jiang Zhang
- Similarity of Temporal Query Logs Based on ARIMA Model
- Social Capital in Friendship-Event Networks Louis Licamele and Lise Getoor Computer Science Dept., University of Maryland College Park, MD 20742 USA {licamele,getoor}@cs.umd.edu
- Solution Path for Semi-Supervised Classification with Manifold Regularization Gang Wang Tao Chen Dit-Yan Yeung Frederick H. Lochovsky Department of Computer Science and Engineering, The Hong Kong University of Science and Technology, Clear Water Bay, Kowloon, Hong Kong, China
- Speedup Clustering with Hierarchical Ranking
- Stability Region based Expectation Maximization for Model-based Clustering
- Star-Structured High-Order Heterogeneous Data Co-clustering based on Consistent Information Theory Bin Gao Tie-Yan Liu Wei-Ying Ma Microsoft Research Asia 4F, Sigma Center, No. 49, Zhichun Road Beijing, 100080, P. R. China {bingao, tyliu, wyma}@microsoft.com
- Subjectivity Categorization of Weblog with Part-Of-Speech Based Smoothing Shen Huang1 Jian-Tao Sun1 Xuanhui Wang2 Hua-Jun Zeng1 Zheng Chen1
- TOP-COP: Mining TOP-K Strongly Correlated Pairs in Large Databases
- Temporal Data Mining in Dynamic Feature Spaces
- The Influence of Class Imbalance on Cost-Sensitive Learning: An Empirical Study Xu-Ying Liu Zhi-Hua Zhou National Laboratory for Novel Software Technology Nanjing University, Nanjing 210093, China {liuxy, zhouzh}@lamda.nju.edu.cn
- The PDD Framework for Detecting Categories of Peculiar Data Mahesh Shrestha, Howard J. Hamilton, Yiyu Yao, Ken Konkel, and Liqiang Geng Department of Computer Science University of Regina {shresthm,hamilton,yyao,konkel1k,gengl}@cs.uregina.ca
- Turning Clusters into Patterns: Rectangle-based Discriminative Data Description Byron J. Gao Martin Ester School of Computing Science, Simon Fraser University, Canada bgao@cs.sfu.ca ester@cs.sfu.ca
- Using an Ensemble of One-Class SVM Classifiers to Harden Payload-based Anomaly Detection Systems Roberto Perdisci , Guofei Gu, Wenke Lee College of Computing, Georgia Institute of Technology, Atlanta, GA 30332, USA
- What is the dimension of your binary data? Taneli Mielikainen Aristides Gionis Heikki Mannila
- Who Thinks Who Knows Who? Socio-cognitive Analysis of Email Networks
- a