PhD Completed 5 Feb 2015
NICTA VRL
Language Technology
Group Dept of Computing and Information Systems
University of Melbourne
Victoria 3010, Australia
Office: Room 8.19, Level 8, Doug McDonell Building (I am no longer here)
Email:
Updates
I just started an exciting new adventure in
Evernote, as a Machine
Learning Software Engineer. As part of the
Augmented Intelligence
team, I am working on improving Evernote, so Evernote can make users smarter.
PhD Thesis
Knowledge discovery and extraction of domain-specific web data
Research Interests
My research interests lie in knowledge discovery and content extraction from
social media data. My current research focuses on improving
information access over troubleshooting-oriented web user forums, by
utilising and combining Machine Learning, Natural Language Processing and
Information Retrieval technologies.
Publications
- Li Wang, Su Nam Kim and Timothy Baldwin (2013) The Utility of
Discourse Structure in Forum Thread Retrieval, In Proceedings of the
Ninth Asian Information Retrieval Societies Conference (AIRS 2013),
Singapore, pp. 284—295. [bib]
- Marco Lui and Li Wang (2013)Recovering
Casing and Punctuation using Conditional Random Fields,
in Proceedings of the Australasian Language Technology Association Workshop 2013 (ALTW 2013),
Brisbane, Australia, pp. 137—141. [bib]
- Timothy Baldwin, Paul Cook, Marco Lui, Andrew MacKinlay and Li Wang (2013)
How Noisy Social Media Text, How Diffrnt
Social Media Sources?, In
Proceedings of the 6th International Joint Conference on Natural
Language Processing (IJCNLP 2013), Nagoya, Japan, pp. 356—364. [bib]
- Li Wang, Su Nam Kim and Timothy Baldwin (2012)
The Utility of
Discourse Structure in Identifying Resolved Threads in Technical User
Forums, In Proceedings of the 24th International Conference on
Computational Linguistics (COLING 2012), Mumbai,
India, pp. 2739—2756. [bib]
-
Li Wang, Diana McCarthy and Timothy Baldwin (2011)
Predicting Thread Linking Structure by Lexical Chaining, In Proceedings of the 2011
Australasian Language Technology Workshop (ALTW 2011), Canberra,
Australia, pp. 76—85.
[slides|bib]
- Li Wang, Marco Lui, Su Nam Kim, Joakim Nivre and Timothy Baldwin (2011)
Predicting
Thread Discourse Structure over Technical Web Forums, In
Proceedings of the 2011 Conference on Empirical Methods in Natural
Language Processing (EMNLP 2011), Edinburgh, UK, pp. 13—25.
[slides|bib|dataset] (Google Plenary Highlight Paper
Award)
- Li Wang, Su Nam Kim and Timothy Baldwin (2010)
Thread-level Analysis over Technical User Forum Data, In
Proceedings of the 2010
Australasian Language Technology Workshop (ALTW 2010), Melbourne,
Australia, pp. 27—31
[slides|bib|dataset]
- Su Nam Kim, Li Wang and Timothy Baldwin (2010) Tagging and Linking Web
Forum Posts, In Proceedings of the Fourteenth Conference on
Computational Natural Language Learning (CoNLL 2010), Uppsala, Sweden,
pp. 192—202.
[bib
|dataset|report]
- Timothy Baldwin, David Martinez, Richard Penman, Su Nam Kim, Marco Lui,
Li Wang and Andrew MacKinlay (2010) Intelligent Linux
Information Access by Data Mining: the ILIAD Project, In
Proceedings
of the NAACL 2010 Workshop on Computational Linguistics in a World of
Social
Media: #SocialMedia, Los Angeles, USA, pp. 15—16.
[bib]
Miscellania