[By Category] [By Topic] [By Year]

Journal Publications

  1. new paper Xingquan Zhu, Bin Li, Xindong Wu, Dan He and Chengqi Zhang: CLAP: Collaborative Pattern Mining for Distributed Information Systems , Decision Support Systems, 2011, accepted.

  2. new paper Dan He, Noah Zaitlen, Bogdan Pasaniuc, Eleazar Eskin, Eran Halperin: Genotyping common and rare variation using overlapping pool sequencing , BMC Bioinformatics to appear, 2011.

  3. Dan He, Farhad Hormozdiari, Nick Furlott, Eleazar Eskin: Efficient Algorithms for Tandem Copy Number Variation Reconstruction in Repeat-rich Regions , Bioinformatics to appear, 2011.

  4. Jae Hoon Sul, Buhm Han, Dan He, Eleazar Eskin: An Optimal Weighted Aggregated Association Test for Identification of Rare Variants Involved in Common Diseases, Genetics, 2010, to appear.

  5. Dan He, Nick Furlotte, Eleazar Eskin: Detection and reconstruction of copy number variations , BMC Bioinformatics. 2010, to appear.

  6. Dan He, Xindong Wu, Xingquan Zhu: Approximate Repeating Pattern Mining with Gap Requirements, The Jounral of Computational Intelligence. 2010, to appear.

  7. Dan He, Arthur Choi, Knot Pipatsrisawat, Adnan Darwiche and Eleazar Eskin: Optimal Algorithms for Haplotype Assembly From Whole-Genome Sequence Data, Bioinformatics , to appear.

  8. Dan He , Abudullah N. Aslan, Alan C.H. Ling: A fast Algorithm for the Constrained Multiple Sequence Alignment problem, Accepted by Acta Cybernetica, 2006.

  9. Dan He, Abdullah Aslan: A space-efficient algorithm for the constrained pairwise sequence alignment problem. Genome Informatics 2005,Genome Informatics Vol. 16, No. 1. ISBN 4-946443-93-2. Universal Academy Press, Inc.

Conference Publications

  1. new paper Dan He, Buhm Han, Eleazar Eskin: Optimal Algorithm for Haplotype Phasing with Imputation using Sequencing Data , the 16th Annual International Conference on Research in Computational Molecular Biology (Recomb 2012), April. 21-24, Barcelona, Spain, 2012.

  2. new paper Dan He, Xingquan Zhu, Douglas S. Parker: How Does Research Evolve? Pattern Mining for Research Meme Cycles , the 2011 IEEE International Conference on Data Mining (ICDM2011), Dec. 11-14, Vancouver, Canada, 2011.

  3. new paper Dan He: Mining Research Topic-related Influence between Academia and Industry , European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases (ECML-PKDD2011) (Acceptance rate: 20% out of 599 submissions), Athens, Greece, Sep 5-9, 2011.

  4. Dan He, Pratima Kunwar, Helen Horton, Eleazar Eskin, Peter Gilbert, Tomer Hertz: Using HLA binding prediction algorithms for epitope mapping in HIV vaccine clinical trials , Second Immunoinformatics and Computational Immunology Workshop (ICIW 2011), Aug 1, 2011 - Aug 3, 2011, Chicago.

  5. Dan He, Farhad Hormozdiari, Nick Furlott, Eleazar Eskin: Efficient Algorithms for Tandem Copy Number Variation Reconstruction in Repeat-rich Regions , HiTSeq 2011 (Joint with ISMB 2011), July 15-19, 2011, Vienna, Austria.

  6. Dan He, Noah Zaitlen, Bogdan Pasaniuc, Eleazar Eskin, Eran Halperin: Genotyping common and rare variation using overlapping pool sequencing , RECOMB Satellite Workshop on Massively Parallel Sequencing (Recomb 2011), 2011, March 26-27 2011, Vancouver, BC, Canada.

  7. Dan He: Mining Research Cycles with Adapted Hierarchical Clustering , Text Mining workshop of the Eleventh SIAM International Conference on Data Mining (SDM 2011), 2011, Mesa, Arizona, April 30, 2011.

  8. Dan He: Learning the Funding Momentum of Research Projects , The 15th Pacific-Asia Conference on Knowledge Discovery and Data Mining(PAKDD 2011) (Acceptance for long presentation: 9.7%) May 24 - Mar 27, 2011, Shenzhen, China.

  9. Pratima Kunwar, Dan He, Ann Collier, Tomer Hertz and Helen Horton: Analysis of epitope-specific HIV T cell Responses during early HIV Infection and their association with viral control , Keystone Symposia: Protection from HIV: Targeted Intervention Strategies, (Poster, selected for oral presentation and travel scholarship) Mar 20 - Mar 25, 2011, Whistler, British Columbia

  10. Dan He, Nick Furlotte, Eleazar Eskin: Efficient Algorithm for Reconstruction of Tandemly organized copy number variations in repeat-rich regions. , The 60th Annual meeting of the American Society of Human Genetics, (ASHG2010) (Poster) Washington DC, Nov. 2-6, 2010.

  11. Michael Welch, Uri Schonfeld, Dan He, Junghoo Cho: Topical Semantics of Twitter Links , Fourth ACM International Conference on Web Search and Data mining (WSDM 2011) (Acceptance rate: 32 (8.6%) + 51 (13.7%) out of 372 submissions) Hong Kong, China, February 9-12, 2011.

  12. Dan He, Nick Furlotte, Eleazar Eskin: Detection and reconstruction of copy number variations , The 21st International Conference on Genome Informatics (GIW 2010) Hangzhou, China, December 16-18, 2010.

  13. Dan He, Eleazar Eskin: Effective Algorithms for Fusion Gene Detection, 10th Workshop on Algorithms in Bioinformatics (WABI2010) , September 6-8, University of Liverpool, United Kingdom.

  14. Dan He, Douglas S. Parker: Topic Dynamics: an alternative model of `Bursts' in Streams of Topics, The 16th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, (SIGKDD 2010) (Acceptance rate: 13% out of 578 submissions) , July 25-28, 2010, Washington DC.

  15. Dan He, Arthur Choi, Knot Pipatsrisawat, Adnan Darwiche and Eleazar Eskin: Optimal Algorithms for Haplotype Assembly From Whole-Genome Sequence Data, The 18th Annual International Conference on Intelligent Systems for Molecular Biology, (ISMB 2010) (Acceptance rate: 19% out of over 240 submissions) , July 11-13, 2010, Boston.

  16. Dan He, Xindong Wu, Xingquan Zhu: Rule Synthesizing from Multiple Related Databases, The 14th Pacific-Asia Conference on Knowledge Discovery and Data Mining, (PAKDD 2010) (Acceptance rate: 10.2% out of 412 submissions). 21-24 June, 2010 - Hyderabad, India.

  17. Dan He, Xingquan Zhu, Xindong Wu: Approximate Repeating Pattern Mining with Gap Requirements , 21st IEEE Int'l Conference on Tools with Artificial Intelligence, (ICTAI 2009)(one of the 8 final list best papers out of 205 submissions). Newark, New Jersey, Nov. 2-4, 2009.

  18. Dan He, Xingquan Zhu, Xindong Wu: Error Detection and Uncertainty Modeling for Imprecise Data, 21st IEEE Int'l Conference on Tools with Artificial Intelligence, (ICTAI 2009) (short paper) Newark, New Jersey, Nov. 2-4, 2009.

  19. Nick Furlotte, Dan He, Eleazar Eskin: Detection and reconstruction of copy number variations , The 59th Annual Meeting, the American Society of Human Genetics, (ASHG 2009) (Poster), Honolulu, Hawaii, Oct. 20-24, 2009.

  20. Dan He, Eleazar Eskin: Optimal Algorithm for Haplotype Assembly from Whole-Genome Sequence Data , Proceedings of the 13th Annual International Conference on Research in Computational Molecular Biology (RECOMB 2009) (Poster), Tucson, Arizona, May 18-21, 2009.

  21. Xingquan Zhu, Peng Zhang, Xindong Wu, Dan He, Chengqi Zhang, and Yong Shi: Cleansing Noisy Data Streams , Proceedings of the IEEE International Conference on Data Mining (ICDM 2008), Pisa, Italy, December 15-19, 2008.

  22. Dan He, Abdullah N. Arslan, Yu He and Xindong Wu:Iterative Refinement of Repeat Sequence Specification Using Constrained Pattern Matching, Proceedings of the IEEE 7th International Symposium on Bioinformatics & Bioengineering (BIBE 2007), Harvard Medical School Conference Center, Cambridge - Boston, Massachusetts, USA, October 14-17, 2007.

  23. Dan He,Xindong Wu and Xingquan Zhu:SAIL-APPROX: An Efficient On-line Algorithm for Approximate Pattern Matching with Wildcards and Length Constraints, Proceedings of the 2007 IEEE International Conference on Bioinformatics and Biomedicine (BIBM'07) (acceptance rate: 60/133), San Jose, CA, USA, November 2-4, 2007.

  24. Dan He : BMA*: an efficient algorithm for one-to-some shortest paths problem on road maps, Proceeding of the 3rd International Conference on Algorithmic Aspects in Information and Management,AAIM'07,Lecture Notes in Computer Science. 6-8 June 2007,Portland, USA

  25. Dan He: A Novel Greedy Algorithm for the Minimum Common String Partition Problem, Proceeding of the 2007 International Symposium on Bioinformatics Research and Applications, ISBRA 2007, Lecture Notes in Computer Science. May 7-10, 2007, Atlanta, Georgia, USA

  26. Dan He , Xindong Wu : An Efficient Algorithm for Finding Approximate Complex Repetitive Patterns, Proceeding of the International Conference on Computational and Systems Biology, CASB 2006, November 13-15, 2006, Dallas, Texas, USA

  27. Abdullah Aslan, Dan He: An Improved Algorithm for the regular expression constrained multiple sequence alignment problem, Proceeding of IEEE the 6th Symposium on Bioinformatics and Bioengineering, BIBE 2006, Oct 16-18, 2006, Washington DC, USA

  28. Dan He, Xindong Wu : Ontology-Based Feature Weighting for Biomedical Literature Classification, Proceeding of the 2006 IEEE International Conference on Information Reuse and Integration, IEEE IRI 2006, Sep 16-18, 2006, Waikoloa, Hawaii, USA

  29. Dan He: Using Suffix Tree to Discover Complex Repetitive Patterns in DNA Sequences, The 28th Annual International Conference of the IEEE Engineering in Medicine and Biology Society, IEEE EMBC 2006, New York City, New York, USA, August 30 - September 3, 2006

  30. Dan He, Abdullah Aslan: Space-efficient Parallel Algorithms for the Constrained Multiple Sequence Alignment Problem, The 2006 International Conference on Bioinformatics & Computational Biology, BIOCOMP 2006, Las Vegas, Nevada, USA, June 26-29, 2006 (Acceptance rate: 46 out of 141)

  31. Dan He, Abdullah Aslan: A* Algorithms for the Constrained Multiple Sequence Alignment Problem, The 2006 International Conference on Artificial Intelligence, ICAI 2006, Las Vegas, Nevada, USA, June 26-29, 2006 (Acceptance rate: 73 + 32 out of 230)

  32. Dan He, Abdullah Aslan: FastPCMSA: An improved parallel algorithm for the constrained multiple sequence alignment problem , The 2006 International Conference on Foundations of Computer Science, FCS 2006, Las Vegas, Nevada, USA, June 26-29, 2006 (Acceptance rate: 31 out of 83)

  33. Dan He, Abdullah Aslan: A space-efficient algorithm for the constrained pairwise sequence alignment problem, The 16th International Conference on Genome Informatics, GIW 2005, PACIFICO YOKOHAMA, Japan, December 19-21, 2005 (Acceptance rate: 26 out of around 60)

  34. Dan He, Abdullah Aslan: A parallel algorithm for the constrained multiple sequence alignment problem , IEEE the 5th Symposium on Bioinformatics and Bioengineering, BIBE 2005, Minneapolis, Minnesota, October, 19-21, 2005 (Acceptance rate: 29 + 18 out of 120)

  35. Dan He, Abdullah Aslan: A fast algorithm for the constrained multiple sequence problem , 11th International Conference on Automata and Formal Languages, AFL 2005, Dobogoko, Hungary, May, 17-20, 2005 (Acceptance rate: 21 out of 37)

Technical Reports

Conference Presentations

  1. Dan He: Using Suffix Tree to Discover Complex Repetitive Patterns in DNA Sequences, The 28th Annual International Conference of the IEEE Engineering in Medicine and Biology Society, IEEE EMBC 2006, New York City, New York, USA, August 30 - September 3, 2006

  2. Dan He : Space-efficient Parallel Algorithms for the Constrained Multiple Sequence Alignment Problem, The 2006 International Conference on Bioinformatics & Computational Biology, BIOCOMP 2006, Las Vegas, Nevada, USA, June 26-29, 2006

  3. Dan He : A* Algorithms for the Constrained Multiple Sequence Alignment Problem , The 2006 International Conference on Artificial Intelligence, ICAI 2006, Las Vegas, Nevada, USA, June 26-29, 2006

  4. Dan He : FastPCMSA: An improved parallel algorithm for the constrained multiple sequence alignment problem , The 2006 International Conference on Foundations of Computer Science, FCS 2006, Las Vegas, Nevada, USA, June 26-29, 2006

  5. Dan He: A space-efficient algorithm for the constrained pairwise sequence alignment problem, The 16th International Conference on Genome Informatics, GIW 2005, PACIFICO YOKOHAMA, Japan, December 19-21, 2005