著作目錄 (Publication List)

Double Click the Underlined Paper Numbers to Download the Hard Copies
(in PostScript/PDF format, mostly compressed)
All Publications Are Copyrighted.



Publication List (1987--) (Last updated: 2011/12/25)

Conference Papers
[1] Su, K.-Y., J.-S. Chang, and H.-H. Hsu, "A Powerful Language Processing System for English-Chinese Machine Translation," Proc. 1987 Int. Conf. on Chinese and Oriental Language Computing, pp. 260-264, Chinese Language Computer Society (in cooperation with Illinois Institute of Technology and IEEE Computer Society), Illinois Institute of Technology, Chicago, Illinois, U.S.A., June 15-17, 1987.
[2] Su, K.-Y., J.-N. Wang, W.-H. Li and J.-S. Chang, "A New Parsing Strategy in Natural Language Processing Based on the Truncation Algorithm," Proc. of Natl. Computer Symposium (NCS) 1987, vol. 2, pp. 580-586, National Taiwan University, Taipei, R.O.C., Dec 17-18, 1987.
[3]Su, K.-Y. and J.-S. Chang, "Semantic and Syntactic Aspects of Score Function," Proc. COLING-88, 12th Int. Conf. on Comput. Linguistics, vol. 2, pp. 642-644, Int. Committee on Computational Linguistics (ICCL), Budapest, Hungary, 22-27 Aug. 1988.
[4] Chang, J.-S. and K.-Y. Su, "Incremental Environment for Scored Machine Translation Systems," Proc. of ROCLING-I, R.O.C. Computational Linguistics Workshops I (1988), pp. 63-73, Nantou, Taiwan, 21-23 Oct. 1988.
[5] Su, K.-Y., J.-N. Wang, M.-H. Su and J.-S. Chang, "A Sequential Truncation Parsing Algorithm Based on the Score Function," Proc. of Int. Workshop on Parsing Technologies (IWPT-89), pp. 95-104, CMU, Pittsburgh, PA, U.S.A., 28-31 August 1989.
[6] Liu, C.-L., J.-S. Chang and K.-Y. Su, "The Semantic Score Approach to the Disambiguation of PP Attachment Problem," Proceedings of ROCLING-III, pp. 253--270, Taipei, R.O.C., Sept., 1990.
[7] Wu, M.-W., J.-S. Chang, and K.-Y. Su, "The Current Status of ArchTran: A Corpus-Based Statistics-Oriented English-Chinese Machine Translation System," Proceedings of the 1991 Workshop on Machine Translation, pp. 123-138, Nantou, Taiwan, 24-26 June, 1991.
[8] Chen, S.-C., J.-S. Chang, J.-N. Wang and K.-Y. Su, "ArchTran: A Corpus-based Statistics-oriented English-Chinese Machine Translation System," Proceedings of Machine Translation Summit III, pp. 33-40, Washington, D.C., USA, July 1-4, 1991.
[9] Chang, J.-S. and Keh-Yih Su, "A Corpus-Based Approach to Ambiguity Resolution," Proceedings of ISCIPA' 91, International Symposium for Chinese Information Processing Application 1991, pp. 43 - 47, Beijing, China, August 19-21, 1991.
[10] Su, K.-Y and J.-S. Chang, "Why Corpus-Based Statistics-Oriented Machine Translation," Proceedings of TMI-92, pp. 249--262, 4th Int. Conf. on Theoretical and Methodological Issues in Machine Translation, Montreal, Canada, June 25--27, 1992. (+Slides)
[11] Chang, J.-S. , Y.-F. Luo and K.-Y. Su, "GPSM: A Generalized Probabilistic Semantic Model for Ambiguity Resolution," Proceedings of ACL-92, pp. 177--184, 30th Annual Meeting of the Association for Computational Linguistics, University of Delaware, Newark, DE, USA, 28 June--2 July, 1992. (PDF)
[12] Su, K.-Y., M.-W. Wu and J.-S. Chang, "A New Quantitative Quality Measure for Machine Translation Systems," Proceedings of COLING-92, vol. II, pp. 433--439, 14th Int. Conference on Computational Linguistics, Nantes, France, 23--28 July, 1992.
[13] Tung-Hui Chiang, Jing-Shin Chang, Ming-Yu Lin and Keh-Yih Su, "Statistical Models for Word Segmentation and Unknown Word Resolution," Proceedings of ROCLING-V, ROC Computational Linguistics Conference V, pp. 123--146, National Taiwan University, Taipei, Taiwan, ROC, Sep. 18--20, 1992. (PDF version)
[14] Su, K.-Y., Jing-Shin Chang and Y.-C. Lin, "A Discriminative Approach for Ambiguity Resolution Based on a Semantic Score Function," Proceedings ICSLP 92, 1992 International Conference on Spoken Language Processing, vol. 1, pp. 149--152, International Conference Center, Banff, Alberta, Canada, Oct. 12--16, 1992.
[15] Chang, J.-S. and K.-Y. Su, "A Corpus-Based Statistics-Oriented Transfer and Generation Model for Machine Translation," Proceedings of TMI-93, pp. 3--14, 5th Int. Conf. on Theoretical and Methodological Issues in Machine Translation, Kyoto, Japan, July 14--16, 1993.
[16] Wang, Jong-Nae, J.-S. Chang and Su, K.-Y., "An Automatic Treebank Conversion Algorithm for Corpus Sharing," Proceedings of ACL-94, pp. 248-254, 32nd Annual Meeting of the Association for Computational Linguistics, New Mexico State University, Las Cruces, New Mexico, USA, 27 June -- 1 July, 1994.
[17] Su, K.-Y., Ming-Wen Wu and J.-S. Chang, "A Corpus-based Approach to Automatic Compound Extraction," Proceedings of ACL-94, pp. 242--247, 32nd Annual Meeting of the Association for Computational Linguistics, New Mexico State University, Las Cruces, New Mexico, USA, 27 June -- 1 July, 1994.
[18] Chang, J.-S., Y.-C. Lin and K.-Y. Su, "Automatic Construction of a Chinese Electronic Dictionary," Proceedings of the Workshop on Very Large Corpora: Supervised Training vs. Self-organizing Methods (ACL-95), pp. 107--120, 33rd Annual Meeting of the Association for Computational Linguistics, Massachusetts Institute of Technology, Cambridge, MA, USA, June 26--30, 1995. [Slides (+error distribution):PDF | PS]
[19] Su, K.-Y., J.-S. Chang and Y.-L. Una Hsu, "A Corpus-based Two-Way Design for Parameterized MT Systems: Rationale, Architecture and Training Issues," Proceedings of TMI-95, Vol. 2, pp. 334-353, 6th Int. Conf. on Theoretical and Methodological Issues in Machine Translation, University of Leuven, Leuven, Belgium, July 5--7, 1995. (+Slides)
[20] Jing-Shin Chang and Keh-Yih Su, "A Multivariate Gaussian Mixture Model for Automatic Compound Word Extraction," Proceedings of ROCLING-X International Conference 1997, pp. 123-142, Taipei, August 22-24, 1997. (Slides: ps.Z) (Slides: PDF)
[21] Jing-Shin Chang and Keh-Yih Su, "Corpus-Based Statistics-Oriented (CBSO) Machine Translation Researches in Taiwan," Proceedings of MT Summit VI, pp. 165-173, San Diego, CA, USA, 29 Oct - 1 Nov, 1997.
[22] Keh-Yih Su and Jing-Shin Chang, "A Customizable, Self-Learnable Parameterized MT System: The Next Generation," (invited paper) Proceedings of the Machine Translation Summit VII '99, pp. 182-188, Natl. Univ. of Singapore, Singapore, Sept. 13-17, 1999. (+Slides)
[23] Jing-Shin Chang and Yu-Tso Lai, "A Preliminary Study on Probabilistic Models for Chinese Abbreviations," Proceedings of the Third SIGHAN Workshop on Chinese Language Learning, pp. 9-16, ACL-2004, Barcelona, Spain, 25 July, 2004. (PostScript) (Slides)
[24] Jing-Shin Chang, "Mining Domain Specific Words from Web Documents," Proceedings of the 4-th China-Japan Joint Conference to Promote Cooperation in Natural Language Processing (CJNLP-04) (abstracts, invited), p. 1, CJNLP-2004, City University of Hong Kong, HK, China, 10-15 Nov., 2004. (Slides: todo)
[25] Jing-Shin Chang, "Domain Specific Word Extraction from Hierarchical Web Documents: A First Step Toward Building Lexicon Trees from Web Corpora," Proceedings of the Fourth SIGHAN Workshop on Chinese Language Learning, pp. 64-71, IJCNLP-05 (International Joint Conference on Natural Language Processing), Jeju Island, Korea, October 14-15, 2005.
[26] Jing-Shin Chang and Wei-Lun Teng, "Mining Atomic Chinese Abbreviation Pairs: A Probabilistic Model for Single Character Word Recovery," Proceedings of the Fifth SIGHAN Workshop on Chinese Language Processing, SIGHAN-2006, COLING-ACL 2006 Workshops, pages 17-24, Sydney, Australia, 22-23, July, 2006. [ISBN 1-932432-70-1]
[27] Jing-Shin Chang and Chun-Kai Kung, "A Chinese-to-Chinese Statistical Machine Translation Model for Mining Synonymous Simplified-Traditional Chinese Terms," Proceedings of Machine Translation Summit XI, pages 81-88, Copenhagen, Denmark, 10-14, September, 2007. (NSC 95-2221-E-260-033-) [ISBN 978-87-90708-16-0]
[28] Jing-Shin Chang and Sheng-Sian Lin, "Improving Translation Fluency with Search-Based Decoding and a Monolingual Statistical Machine Translation Model for Automatic Post-Editing," Proceedings of the 21st International Conference on Computational Linguistics and Speech Processing, ROCLING-2009, pp. 195-207, National Chung Hsing University, Taichung, Taiwan, ROC, September 1-2, 2009. [ISBN 978-957-30792-2-4]
[29] Jing-Shin Chang, "Hierarchical Web Document Classification Based on Hierarchically Trained Domain Specific Words," Proceedings of the 21st International Conference on Computational Linguistics and Speech Processing, ROCLING-2009, pp. 345-357, National Chung Hsing University, Taichung, Taiwan, ROC, September 1-2, 2009. [ISBN 978-957-30792-2-4]
[30] Jing-Shin Chang and Shih-Jay Chiou, "An EM Algorithm for Context-Based Searching and Disambiguation with Application to Synonym Term Alignment," Proceedings of the 23rd Pacific Asia Conference on Language, Information and Computation, PACLIC-23, vol. 2, pp. 630-637, City University of Hong Kong, HK, PRC, Dec. 3-5, 2009. (NSC 96-2221-E-260-022-) [ISBN 978-962-442-320-4]
[31] Jing-Shin Chang and Shu-Fan Shih, "An Automatic Query Augmentation Model for Answering Well-Defined Questions in Top-1 Search Result," Proceedings of the 22nd International Conference on Computational Linguistics and Speech Processing, ROCLING-2010, pp. 17-29, National Chi Nan University, Puli, Nantou, Taiwan, ROC, September 1-2, 2010. [ISBN 978-957-30792-3-1]
[32] Yi-Hsuan Chuang, Chao-Lin Liu, and Jing-Shin Chang, "Translating Common English and Chinese Verb-Noun Pairs in Technical Documents with Collocational and Bilingual Information," Proceedings of the 25th Pacific Asia Conference on Language, Information and Computation, PACLIC-25, pp. 493-502. Nanyang Technological University, Singapore, December 16-18, 2011. [ISBN 978-4-905166-02-3]

Journals and Books
[1] Su, K.-Y. and J.-S. Chang,"Some Key Issues in Designing MT Systems," Machine Translation, vol. 5, no. 4, pp. 265--300, Kluwer Academic Publishers, 1990.
[2] Su, K.-Y., J.-N. Wang, M.-H. Su and J.-S. Chang, "GLR Parsing with Scoring," In Tomita, Masaru (ed.), Generalized LR Parsing, Chapter 7, pp. 93--112, Kluwer Academic Publishers, 1991. (PDF version)
[3] Su, K.-Y. and J.-S. Chang, "Why MT Systems Are Still Not Widely Used?" Machine Translation, vol. 7, no. 4. pp. 285--291, Kluwer Academic Publishers, 1993. (earlier version)
[4] Chang, J.-S. and K.-Y. Su, "BehaviorTran: Its Current Status, Prospects and Philosophy," Journal of the Asia-Pacific Association for Machine Translation, No. 7, 1994.
[5] Su, K.-Y., J.-S. Chang, J.-N. Wang, Emma Chang, and M.-W. Wu, "The Computational Models of the BehaviorTran English-Chinese Machine Translation System," in C.-R. Huang, K.-J. Chen and Benjamin K. T'sou (eds.): Journal of Chinese Linguistics, Monograph Series Number 9, Readings in Chinese Natural Language Processing, pp. 223-245. University of California, Berkeley. 1996.
[6] Tung-Hui Chiang, Jing-Shin Chang, Ming-Yu Lin and Keh-Yih Su, "Statistical Word Segmentation," in C.-R. Huang, K.-J. Chen and Benjamin K. T'sou (eds.): Journal of Chinese Linguistics, Monograph Series Number 9, Readings in Chinese Natural Language Processing, pp. 147-173. University of California, Berkeley. 1996. (to check: this PDF is probably not the final version)
[7] K.-Y. Su, Tung-Hui Chiang, and Jing-Shin Chang, "An Overview of Corpus-Based Statistics-Oriented (CBSO) Techniques for Natural Language Processing," International Journal of Computational Linguistics & Chinese Language Processing (CLCLP), vol. 1 no. 1, pp. 101--157, August 1996. (CLCLP archived version: ps)
[8] Yu-Ling Una Hsu, Jing-Shin Chang, and Keh-Yih Su, "Computational Tools and Resources for Linguistics Studies," International Journal of Computational Linguistics & Chinese Language Processing (CLCLP), vol. 2, no. 1, pp. 1-39, 1997.
[9] Jing-Shin Chang and Keh-Yih Su, "An Unsupervised Iterative Method for Chinese New Lexicon Extraction", International Journal of Computational Linguistics and Chinese Language Processing (CLCLP), vol. 2, no. 2, pp. 97-148, August, 1997.
[10]黃居仁、蘇克毅、張景新(1997) 展望:機器翻譯之研發與衝擊 中央研究院, 第1-29頁。
[11] Jing-Shin Chang and Wei-Lun Teng, "Mining Atomic Chinese Abbreviations with a Probabilistic Single Character Recovery Model," Language Resources and Evaluation, 40:367-374, DOI 10.1007/s10579-007-9026-8, Springer-Verlag, 2006. (Published online: 18 July 2007, Accepted: 14 May 2007) (SCIE) (NSC 93-2213-E-260-015-)

Master Thesis
(* A very early work, may be the first work, on lexicalized probabilistic parsing *)
(* with context sensitive & structure normalization models *)
(* A successor of COLING-88 paper. *)
Jing-Shin Chang, GPSM: A Generalized Probabilistic Semantic Model for Ambiguity Resolution (解決岐義問題之一般化語義機率模式), Master thesis, Institute of Computer Science and Information Engineering, National Chiao-Tung University, Hsinchu, Taiwan, R.O.C., July, 1990.
Ph.D. Dissertation
(* An early work on PR maximization problems *)
Jing-Shin Chang, Automatic Lexicon Acquisition and Precision-Recall Maximization for Untagged Text Corpora (詞彙自動抽取及最佳化技術之研究), Ph.D. dissertation, Department of Electrical Engineering, National Tsing-Hua University, Hsinchu, Taiwan, R.O.C., July, 1997. - All-in-one (.tar.gz with compressed files, including PS Slides) [about 2M] - Individual parts of the All-in-one package: - CopyRights & README - Cover Sheets (Chinese & English) - Abstract (Chinese), Abstract (English), Acknowledgements - Table of Contents - List of Figures - List of Tables - Contents [compressed (.Z) | uncompressed ] - Appendices - References - Slides (ps.Z) - Slides (PDF)
Lectures/Tutorial Courses/Invited Talks
- [1] Introduction to Corpus-based Statistics-oriented (CBSO) Techniques Su, K.-Y., T.-H. Chiang and J.-S. Chang, "Introduction to Corpus-based Statistics-oriented (CBSO) Techniques," Pre-Conference Workshop on Corpus-based NLP, ROC Computational Linguistics Conference VII, National Tsing-Hua Univ., Taiwan, ROC., Aug. 1994. Part I: Introduction (PDF/4) (PS/4) (PDF) (PS) Part II: Basic Concepts (PDF/4) (PS/4) (PDF) (PS) Part III: Techniques (PDF/4) (PS/4) (PDF) (PS) Errata: Corrections to Part I-III TXT - [2] Unsupervised Learning for Natural Language Processing - ROCLING/ACLCLP, Academia Sinica, Taipei, Taiwan, ROC., 12/10/1999. - Co-lecture with Dr. Keh-Yih Su. - Workshop Slides: in PostScript (1x1, 2x1), PDF (1x1, 2x1) - [3] Lecture for Statistical Natural Language Processing - an Open Tutorial for Academic Institutes - Microsoft Research Asia, Beijing, PRC, Aug 17-18, 2002 - Co-lecture with Dr. Keh-Yih Su. - Lecture Slides: in compressed PDF (.PDF.zip) (Some minor typos are known in the current version. Any correction is appreciated.) - [4] Automatic Lexicon Acquisition (with Precision-Recall Maximization Techniques) - SWCL-2002 (1st Student Workshop on Computational Linguiatics), Peking University, Beijing, PRC, Aug 21, 2002 - SWCL-2002 Workshop Slides - [5] Pitfalls in Applying Unsupervised Learning to NLP - Pre-Conference Tutorial, IJCNLP-04, Mar 21, 2004 - IJCNLP-04, Sanya, Hainan, PRC, Mar 22-24, 2004 - Jing-Shin Chang and Keh-Yih Su. - Tutorial Slides: in compressed PDF (.PDF.zip) - [6] Mining Domain Specific Words from Web Documents - CJNLP-04, 10-15 Nov, 2004 - City University of Hong Kong, HK, PRC, 10-15 Nov, 2004 - Jing-Shin Chang - (todo) - [7] Statistical Models for Chinese Abbreviations: Abbrevistion Generation, Recovery and Lexicon Acquisition - CJNLP-06, 13-15 Nov, 2006 - Shanghai Jiao Tong University, Shanghai, PRC - Jing-Shin Chang - Slides (CD version)
Patents
Su, Keh-Yih, Jing-Shin Chang, Jong-Nae Wang and Mei-Hui Su, Multiple Score Language Processing System, US Patent Number: 5,418,717, Date of Patent: May 23, 1995 Filed: Dec. 12, 1991 (CIP: Aug. 27, 1990) Valid Dates: ~2012/5/23. (MAX(20 years+filed date, 17 years + issued date)) Su, Keh-Yih, Jing-Shin Chang, Jong-Nae Wang and Mei-Hui Su, 以多重評分函數為基礎的語言處理系統, ROC Patent Number: 發明第 63946 號, Valid Dates: 1993/10/01-2008/09/30. (15 years) Su, Keh-Yih, Jing-Shin Chang, Jong-Nae Wang, and Mei-Hui Su, 以多重評分函數為基礎的語言處理系統, PRC, 專利號: ZL 92 1 14377.X, 國際專利主分類號: GO6F 17/28, 證書號第65974號, Date Patented: 2000/12/15 Valid Dates: 1992/12/10-2007/12/10. (15 years)
Theses of Students
[1] Lai, Yu-Tso (賴育佐), 2003. "A Probabilistic Model for Chinese Abbreviations", Master Thesis, Department of Computer Science and Information Engineering, National Chi Nan University, Taiwan, ROC, July, 2003. ("中文縮寫詞之機率統計模式")
[2] Huang, Ho-Yu (黃厚諭), 2003. "Mining Bilingual Translation Pairs from the Web", Master Thesis, Department of Computer Science and Information Engineering, National Chi Nan University, Taiwan, ROC, July, 2003. ("全球資訊網雙語對譯文件搜尋模式")
[3] Teng, Wei-Lun (鄧偉倫), 2006. "Automatic Models for Mining Atomic Chinese Abbreviations", Master Thesis, Department of Computer Science and Information Engineering, National Chi Nan University, Taiwan, ROC, July, 2006.
[4] Kung, Chun-Kai (龔俊愷), 2007. "Aligning Simplified-Traditional Chinese Vocabularies with a Chinese-to-Chinese Statistical Machine Translation Model", Master Thesis, Department of Computer Science and Information Engineering, National Chi Nan University, Taiwan, ROC, July, 2007.
[5] Lin, Sheng-Sian (林勝賢), 2007. "Improving Translation Fluency with a Monolingual Statistical Machine Translation Model", Master Thesis, Department of Computer Science and Information Engineering, National Chi Nan University, Taiwan, ROC, July, 2007.
[6] Shih, Shu-Fan (施書帆), 2007. "A Query Augmentation Model for Answering Well-Defined Questions", Master Thesis, Department of Computer Science and Information Engineering, National Chi Nan University, Taiwan, ROC, July, 2007.
[7] Chiou, Shi-Jay (邱世杰), 2009. "A Statistical Context-Based Searching Model", Master Thesis, Department of Computer Science and Information Engineering, National Chi Nan University, Taiwan, ROC, July, 2009.

Freewares
Jing-Shin Chang, "BeTTY: An Online Code Converter for Chinese Terminals of the Big5 Encoding Family plus Offline Code Converters for Traditional and Simplified Chinese Codes", 1993-1995. Available at: ftp://ftp.ifcss.org/pub/software/unix/convert/BeTTY-1.534.tar.gz, ftp://nctuccca.edu.tw/Chinese/ifcss/software/unix/convert/BeTTY-1.534.tar.gz, A Local Copy in BeTTY-1.534.tar.gz or BeTTY-1.534.tar.Z (Patched features by wdong@umist.ac.uk was also available in the above directories)
Miscellaneous
Jing-Shin Chang, "ROCLING Text Corpus Exchange Formats," for SIGMT, Special Interest Group on Machine Translation, ROC Computational Linguistics Society, May, 1994. "RFC 1922: Chinese Character Encoding for Internet Messages," (2nd Rev.) was invited as a co-author of this RFC document for my technical revisions on the ROC part of the ISO-2022 conformant encoding standard, aka ISO-2022-CN, (originated from the Chinese Character Subworking Group of the I18N/L10N Working Group of the Asian Pacific Networking Group, APNG-CC), March, 1995. Reports related to RFC1922 Jing-Shin Chang, "CJK DOCP Recommendation: CJK multilingual linguistic markup -- CJKDOCP Text Corpus Exchange Formats," aka CJK-DOCP/97-9, ISO/IEC JTC1/SC18/WG8, CJK DOCP (CJK Document Processing) Meeting, 10th CJK DOCP Meeting, Taipei, Taiwan, ROC, Feb. 20-21, 1997. (draft in Dec 6, 1995.) Personal Information ...