publications Hal Daumé III
about me
cv + bio
publications
research
teaching
software
photos
calendar
contact me
links
All of my publications are available here. My Erdös number is at most 4 (me to J. Langford to A. Blum to J. Spencer to P. Erdös).

Click here for a listing with summaries.

Thesis

newPractical Structured Learning for Natural Language Processing. Ph.D. Thesis. [PDF] [BIB] [html]

Journal Papers

newSearch-based Structured Prediction. Machine Learning Journal (Submitted). (With:John Langford and Daniel Marcu) [PDF] [BIB] [html]
newDomain Adaptation for Statistical Classifiers. Journal of Artificial Intelligence Research (2006). (With:Daniel Marcu) [PDF] [BIB]
Induction of Word and Phrase Alignments for Automatic Document Summarization. Computational Linguistics (2005). (With:Daniel Marcu) [PDF] [BIB]
A Bayesian Model for Supervised Clustering with the Dirichlet Process Prior. Journal of Machine Learning Research (2005). (With:Daniel Marcu) [PDF] [BIB]

Conference Papers

newFast search for Dirichlet process mixture models. Conference on AI and Statistics (2007) -- Preliminary version. [PDF] [BIB]
newBayesian Query-Focused Summarization. Association for Computational Linguistics 2006. (With:Daniel Marcu) [PDF] [BIB]
A Large-Scale Exploration of Effective Global Features for a Joint Entity Detection and Tracking Model. Human Language Technologies/Empirical Methods in NLP 2005. (With:Daniel Marcu) [PDF] [BIB]
Learning as Search Optimization: Approximate Large Margin Methods for Structured Prediction. International Conference on Machine Learning 2005. (With:Daniel Marcu) [PDF] [BIB]
A Phrase-Based HMM Approach to Document/Abstract Alignment. Empirical Methods in NLP 2004. (With:Daniel Marcu) [PDF] [BIB]
NP Bracketing by Maximum Entropy Tagging and SVM Reranking. Empirical Methods in NLP 2004. (With:Daniel Marcu) [PDF] [BIB]
Web Search Intent Induction via Automatic Query Reformulation. North American Association for Computational Linguistics 2004 Short Paper. (With:Eric Brill) [PDF] [BIB]
The Importance of Lexicalized Syntax Models for Natural Language Generation Tasks. International Conference on Natural Language Generation 2002. (With:Kevin Knight, Irene Langkilde-Geary, Daniel Marcu and Kenji Yamada) [PDF] [BIB]
A Noisy-Channel Model for Document Compression. Association for Computational Linguistics 2002. (With:Daniel Marcu) [PDF] [BIB]
Integrated Information Management: An Interactive, Extensible Architecture for Information Retrieval. Human Language Technologies 2001. (With:Eric Nyberg) [PDF] [BIB]

Workshop Papers

Search-Based Structured Prediction as Classification. NIPS 2005 Workshop on Advances in Structured Learning for Text and Speech Processing. (With:John Langford and Daniel Marcu) [PDF] [BIB]
Bayesian Summarization and DUC and a Suggestion for Extrinsic Evaluation. Document Underanding Conference (DUC) 2005. (With:Daniel Marcu) [PDF] [BIB]
Bayesian Multi-Document Summarization at MSE. ACL 2005 Workshop on Multilingual Summarization Evaluation (MSE). (With:Daniel Marcu) [PDF] [BIB]
Supervised clustering with the Dirichlet process. NIPS 2004 Learning With Structured Outputs Workshop. (With:Daniel Marcu) [PDF] [BIB]
Generic Sentence Fusion is an Ill-Defined Summarization Task. Text Summarization Branches Out Workshop (ACL 2004). (With:Daniel Marcu) [PDF] [BIB]
A Tree-Position Kernel for Document Compression. Document Underanding Conference (DUC) 2004. (With:Daniel Marcu) [PDF] [BIB]
GLEANS: A Generator of Logical Extracts and Abstracts for Nice Summaries. Document Underanding Conference (DUC) 2002. (With:Abdesammad Echihabi, Daniel Marcu, Dragos Stefan Munteanu and Radu Soricut) [BIB]

Book Review

Book Review: Automatic Summarization (by Inderjeet Mani). Machine Translation. [PDF]

Unpublished Papers

The following papers are published anywhere, nor have they been peer reviewed. I put them up because I think (hope!) people might find them useful.
newSearn in Practice. (With:John Langford and Daniel Marcu) [PDF] [BIB] [html]
Carefully Approximated Bayes Factors for Feature Selection in MaxEnt Models. [PDF] [BIB]
Notes on CG and LM-BFGS Optimization of Logistic Regression. [PDF] [BIB]
Support Vector Machines for Natural Language Processing. [PDF]
From Zero to Reproducing Kernel Hilbert Spaces in Twelve Pages or Less. [PDF] [BIB]
Yet Another Haskell Tutorial. [PDF] [BIB] [html]
A Phrase-Based HMM. [PDF] [BIB]
Some notes on binning for Good-Turing smoothing.
Asymmetry of Coordination. [PDF] [BIB]
quick links
   nlp blog
   searn
   nlp/ml meeting
   ml (cs5350/6350)
   thesis
   jmlr
   haskell tutorial
conferences
   aistats 07
   naacl 07
   icml 07
   acl 07
   aaai 07
   sigir 07
   cogsci 07
last updated on three january, two thousand seven; contact me AT hal3 DOT name