publications
Hal Daumé III
about me
cv + bio
publications
research
teaching
software
photos
calendar
contact me
links
All of my publications are available here. My Erdös number is at most 4 (me to J. Langford to A. Blum to J. Spencer to P. Erdös).
Click
here
for a listing with summaries.
Thesis
Practical Structured Learning for Natural Language Processing.
Ph.D. Thesis.
[PDF]
[BIB]
[html]
Journal Papers
Search-based Structured Prediction.
Machine Learning Journal (Submitted). (With:
John Langford
and
Daniel Marcu
)
[PDF]
[BIB]
[html]
Domain Adaptation for Statistical Classifiers.
Journal of Artificial Intelligence Research (2006). (With:
Daniel Marcu
)
[PDF]
[BIB]
Induction of Word and Phrase Alignments for Automatic Document Summarization.
Computational Linguistics (2005). (With:
Daniel Marcu
)
[PDF]
[BIB]
A Bayesian Model for Supervised Clustering with the Dirichlet Process Prior.
Journal of Machine Learning Research (2005). (With:
Daniel Marcu
)
[PDF]
[BIB]
Conference Papers
Fast search for Dirichlet process mixture models.
Conference on AI and Statistics (2007) -- Preliminary version.
[PDF]
[BIB]
Bayesian Query-Focused Summarization.
Association for Computational Linguistics 2006. (With:
Daniel Marcu
)
[PDF]
[BIB]
A Large-Scale Exploration of Effective Global Features for a Joint Entity Detection and Tracking Model.
Human Language Technologies/Empirical Methods in NLP 2005. (With:
Daniel Marcu
)
[PDF]
[BIB]
Learning as Search Optimization: Approximate Large Margin Methods for Structured Prediction.
International Conference on Machine Learning 2005. (With:
Daniel Marcu
)
[PDF]
[BIB]
A Phrase-Based HMM Approach to Document/Abstract Alignment.
Empirical Methods in NLP 2004. (With:
Daniel Marcu
)
[PDF]
[BIB]
NP Bracketing by Maximum Entropy Tagging and SVM Reranking.
Empirical Methods in NLP 2004. (With:
Daniel Marcu
)
[PDF]
[BIB]
Web Search Intent Induction via Automatic Query Reformulation.
North American Association for Computational Linguistics 2004 Short Paper. (With:
Eric Brill
)
[PDF]
[BIB]
The Importance of Lexicalized Syntax Models for Natural Language Generation Tasks.
International Conference on Natural Language Generation 2002. (With:
Kevin Knight
,
Irene Langkilde-Geary
,
Daniel Marcu
and Kenji Yamada)
[PDF]
[BIB]
A Noisy-Channel Model for Document Compression.
Association for Computational Linguistics 2002. (With:
Daniel Marcu
)
[PDF]
[BIB]
Integrated Information Management: An Interactive, Extensible Architecture for Information Retrieval.
Human Language Technologies 2001. (With:
Eric Nyberg
)
[PDF]
[BIB]
Workshop Papers
Search-Based Structured Prediction as Classification.
NIPS 2005 Workshop on Advances in Structured Learning for Text and Speech Processing. (With:
John Langford
and
Daniel Marcu
)
[PDF]
[BIB]
Bayesian Summarization and DUC and a Suggestion for Extrinsic Evaluation.
Document Underanding Conference (DUC) 2005. (With:
Daniel Marcu
)
[PDF]
[BIB]
Bayesian Multi-Document Summarization at MSE.
ACL 2005 Workshop on Multilingual Summarization Evaluation (MSE). (With:
Daniel Marcu
)
[PDF]
[BIB]
Supervised clustering with the Dirichlet process.
NIPS 2004 Learning With Structured Outputs Workshop. (With:
Daniel Marcu
)
[PDF]
[BIB]
Generic Sentence Fusion is an Ill-Defined Summarization Task.
Text Summarization Branches Out Workshop (ACL 2004). (With:
Daniel Marcu
)
[PDF]
[BIB]
A Tree-Position Kernel for Document Compression.
Document Underanding Conference (DUC) 2004. (With:
Daniel Marcu
)
[PDF]
[BIB]
GLEANS: A Generator of Logical Extracts and Abstracts for Nice Summaries.
Document Underanding Conference (DUC) 2002. (With:
Abdesammad Echihabi
,
Daniel Marcu
,
Dragos Stefan Munteanu
and
Radu Soricut
)
[BIB]
Book Review
Book Review: Automatic Summarization (by Inderjeet Mani).
Machine Translation.
[PDF]
Unpublished Papers
The following papers are published anywhere, nor have they been peer reviewed. I put them up because I think (hope!) people might find them useful.
Searn in Practice.
(With:
John Langford
and
Daniel Marcu
)
[PDF]
[BIB]
[html]
Carefully Approximated Bayes Factors for Feature Selection in MaxEnt Models.
[PDF]
[BIB]
Notes on CG and LM-BFGS Optimization of Logistic Regression.
[PDF]
[BIB]
Support Vector Machines for Natural Language Processing.
[PDF]
From Zero to Reproducing Kernel Hilbert Spaces in Twelve Pages or Less.
[PDF]
[BIB]
Yet Another Haskell Tutorial.
[PDF]
[BIB]
[html]
A Phrase-Based HMM.
[PDF]
[BIB]
Some notes on binning for Good-Turing smoothing.
Asymmetry of Coordination.
[PDF]
[BIB]
quick links
nlp blog
searn
nlp/ml meeting
ml (cs5350/6350)
thesis
jmlr
haskell tutorial
conferences
aistats 07
naacl 07
icml 07
acl 07
aaai 07
sigir 07
cogsci 07
last updated on three january, two thousand seven; contact