Jeffrey Jestes 50 S. Central Campus Drive, School of Computing Salt Lake City, UT, USA 84112 Voice: (801) 648-9790 Fax: (801) 581-5843 E-mail: jestes@cs.utah.edu Webpage: http://www.cs.utah.edu/~jestes
Job Objective: Software Developer, Research Engineer, Data Analyst, Cluster Administrator Education: • Ph.D. in Computer Science, University of Utah, July 2013 (GPA 4.00 / 4.00) - Focus: Data Management and Analysis - Dissertation: Efficient Summarization Techniques for Massive Data • B.S. Computer Science, Florida State University, April 2008 (GPA 4.00 / 4.00) Work/Research Experience:University of Utah Datagroup Cluster Administrator & Big Data Analyst August 2011 - July 2013 - Description: Deployed and administered MapReduce cluster. Lead studies on data analytics on real world BIG data including meteorological and sensor data. • Microsoft Research: eXtreme Computing Group Research Intern May 2012 - August 2012 - Description: Developed automated text classifiers and parsers using machine learning techniques, from web data, improving accuracy over the state of the art up to 30%. • Microsoft Research: eXtreme Computing Group Research Intern June 2011 - September 2011 - Description: Developed light-weight distributed transaction services for cloud-based applications which offer guarantees stronger than eventual consistency. • Florida State University Datagroup Cluster Administrator & Big Data Analyst January 2010 - August 2011 - Description: Deployed and administered MapReduce cluster. Performed data analytics on real world BIG data including IP traces, sensor data, and meteorological data. Programming Languages: - C++ (expert) - Java, C#, C, SQL (proficient) - Perl, Python (prior experience) Programming Platforms: - Linux - Hadoop MapReduce - Microsoft SQL Server Projects: • Data Analytics & Summaries for Massive Data - Skills: C++, Java, Hadoop MapReduce, Perl, TPIE - Description: Developed techniques to accelerate data analytics through summaries considering CPU, I/O, and communication costs resulting in decreased summary construction time and data analytics by more than 10 times. - Project webpage: http://www.cs.utah.edu/~jestes/datasummaries • Ranking and Monitoring Probabilistic Data - Skills: C, C++, Perl, SQL, Microsoft SQL Server - Proposed algorithms to rank and monitor data with uncertainties, such as noise, in both centralized and parallel & distributed settings improving CPU, I/O, and communication more than 10 times over baselines. - Project webpage: http://www.cs.utah.edu/~jestes/distprob Employment Authorization: Citizen of the United States of America Journal Publications: 1. Semantics of Ranking Queries for Probabilistic Data, by J. Jestes, G. Cor- mode, F. Li, K. Yi, Vol. 23, No. 12, pages 1903-1917, IEEE Transactions on Knowledge and Data Engineering (TKDE), 2011. Conference Publications: 1. Quality and Efficiency for Kernel Density Estimates in Large Data, by Y. Zheng, J. Jestes, J. Phillips, F. Li, In Proceedings of 32nd ACM SIGMOD International Conference on Management of Data (SIGMOD 2013 Accep- tance Rate: 20%), pages TBA, NYC, NY, June 2013. 2. Ranking Large Temporal Data, by J. Jestes, J. Phillips, F. Li, M. Tang, In Proceedings of 38th International Conference on Very Large Databases (VLDB 2012 Acceptance Rate: 20.3%), PVLDB 5(11): 1412-1423, Istanbul, Turkey, August, 2012. 3. Building Wavelet Histograms on Large Data in MapReduce, by J. Jestes, K. Yi, F. Li, In Proceedings of 38th International Conference on Very Large Databases (VLDB 2012 Acceptance Rate: 20.3%), PVLDB 5(2): 109-120, Istanbul, Turkey, August, 2012. 4. Efficient Parallel kNN Joins for Large Data in MapReduce, by C. Zhang, F. Li, J. Jestes, In Proceedings of 15th International Conference on Extending Database Technology (EDBT 2012 Acceptance Rate: 22.5%), pages 38-49, Berlin, Germany, March, 2012. 5. Efficient Threshold Monitoring for Distributed Probabilistic Data, by M. Tang, F.Li, J. Phillips, J. Jestes, In Proceedings of 28th IEEE International Conference on Data Engineering (ICDE 2012 Acceptance Rate: 20%), pages 1120-1131, Washington DC. April 2012. 6. Probabilistic String Similarity Joins, by J. Jestes, F. Li, Z. Yan, K. Yi, In Proceedings of 29th ACM SIGMOD International Conference on Manage- ment of Data (SIGMOD 2010 Acceptance Rate: 20.8%), pages 327-338, Indianapolis, Indiana, June 2010. 7. Ranking Distributed Probabilistic Data, by F. Li, K. Yi, J. Jestes, In Pro- ceedings of 28th ACM SIGMOD International Conference on Management of Data (SIGMOD 2009 Acceptance rate: 15.9%), pages 361-374, Providence, Rhode Island, June 2009. Achievements & Awards: • Best Presentation, 2011 Computer Science Research Conference Building Wavelet Histograms on Large Data in MapReduce Florida State University, Tallahassee, FL, USA April 2011 • Full Instructor of Object Oriented Programming in C++ Florida State University, Tallahassee, FL, USA May 2010 - August 2010 • GAANN Fellowship, U.S. Department of Education Florida State University, Tallahassee, FL, USA August 2008 - August 2010 • Member Phi Betta Kappa, Alpha of Florida Florida State University, Tallahassee, FL, USA Fall 2008 • Graduated with Honors, Summa Cum Laude with a 4.00 GPA (out of 4.00) Florida State University, Tallahassee, FL, USA August 2004 - April 2008 • Member National Society of Collegiate Scholars Florida State University, Tallahassee, FL, USA Fall 2006 References: Available upon request.