首页--语言、文字论文--语言学论文--应用语言学论文

Looking for Better Chinese Indexes: A Corpus-based Approach to Base NP Detection and Indexing

Chapter 1. Introduction第14-22页
    1.1. Thesis Objectives第17-18页
    1.2. Thesis Structure第18-22页
Chapter 2. Basics of Indexing第22-37页
    2.1. Information Retrieval Systems第22-24页
    2.2. Content Representation and Descriptors第24-28页
        2.2.1. Term Exhaustivity and Specificity第24-25页
        2.2.2. Single and Complex Terms第25-26页
        2.2.3. Term Weighting Factors第26-27页
        2.2.4. Stop Words第27-28页
    2.3. Query第28-29页
    2.4. Relevance Judgement and Evaluation第29-32页
    2.5. Retrieval Models第32-33页
    2.6. Vector Space Model第33-35页
        2.6.1. Query and Document Similarity第34-35页
        2.6.2. Weighting Terms in VSM第35页
    2.7. Summary第35-37页
Chapter 3. Chinese Indexing Basics第37-46页
    3.1. Chinese Characters,N-grams and Words第38-39页
    3.2. Character-based Indexing第39-40页
    3.3. N-gram-based Indexing第40-42页
    3.4. Word-based Indexing第42-45页
        3.4.1. Word Segmentation and Indexing第42-44页
        3.4.2. Word-based Indexing Procedures第44-45页
    3.5. Summary:Comparing Single Term Indexing Methods第45-46页
Chapter 4. Phrase Indexing:Why,What and How第46-76页
    4.1. The Need for Complex Term Indexing第46-50页
        4.1.1. The Discrimination Model第46-48页
        4.1.2. DisV,df, Specificity and Indexing Quality第48-50页
    4.2. The Need for Chinese Phrase Indexing第50-56页
        4.2.1. Index Numbers and Frequencies第50-51页
        4.2.2. Increase Rate of Indexes第51-53页
        4.2.3. Document Frequencies of Chinese Word Indexes第53-55页
        4.2.4. Section Summary第55-56页
    4.3. Phrases as Complex Indexes:Related Work第56-71页
        4.3.1. Defining Phrase第56-57页
        4.3.2. Traditional Chinese Phrase Studies第57-59页
        4.3.3. Ways of Extracting Phrases第59-65页
        4.3.4. Phrase Representation Approaches第65-67页
        4.3.5. Phrase Weighting and Similarity Calculation第67-69页
        4.3.6. Effectiveness of Phrase Indexing第69-71页
    4.4. The State of Art of Chinese Phrase Indexing第71-73页
    4.5. Summary:What Affects Phrase Indexing Effectiveness?第73-76页
Chapter 5. BaseNP Notion and Detection:A Corpus-based Approach第76-119页
    5.1. Chinese Base Noun Phrases第76-92页
        5.1.1. Defining BaseNP第77-80页
        5.1.2. Relationship of BaseNP Components and Structures第80-83页
        5.1.3. Transforming a BaseNP into a Uniform Structure第83-87页
        5.1.4. BaseNP Templates第87-89页
        5.1.5. BaseNP Words and Non-baseNP Words第89-91页
        5.1.6. Section Summary:A Top-down View on BaseNP第91-92页
    5.2. The Corpus-based Approach to Language Processing第92-94页
    5.3. BaseNP Forming Ability Hypotheses第94-99页
        5.3.1. the Hypotheses第94-95页
        5.3.2. Defining the Abilities第95-99页
    5.4. BaseNP Detection:Algorithms第99-119页
        5.4.1. BaseNP Detection Methods:an Overview第99-103页
        5.4.2. Learning and Measuring BaseNP Forming Abilities第103-108页
        5.4.3. Applying What Is Learned第108-116页
        5.4.4. Section Summary第116-119页
Chapter 6. BaseNP Detection:Empirical Studies and Experiments第119-152页
    6.1. Objectives and Designs第119-123页
        6.1.1. Experiment Designs第120-121页
        6.1.2. Evaluation Considerations第121-122页
        6.1.3. General Procedures for BaseNP Detection Experimentation第122-123页
    6.2. BaseNP Detection Experimental Environment第123-127页
        6.2.1. The Raw Corpus第123-124页
        6.2.2. BaseNP Marking Procedures and Guidelines第124-126页
        6.2.3. Statistics about the Marked Corpus第126页
        6.2.4. The Tag Set and Dictionary第126-127页
        6.2.5. The BaseNP Detection Module第127页
    6.3. Knowledge Bases Acquisition第127-132页
        6.3.1. An Overview of the Three Knowledge Bases第128页
        6.3.2. Individual Differences among Words in Forming BaseNPs第128-129页
        6.3.3. Individual Differences among Tags in Forming baseNPs第129-130页
        6.3.4. Individual Differences among Templates in Forming BaseNPs第130-131页
        6.3.5. Summary of the Learning Results第131-132页
    6.4. Experiment 1:Boundary-based Detection第132-140页
        6.4.1. Algorithm and Procedures第132-133页
        6.4.2. Factors to be Tested and Experiment Designs第133-134页
        6.4.3. Results of Experiment 1第134-140页
        6.4.4. Summary of Experiment 1第140页
    6.5. Experiment 2:Template-based Detection第140-144页
        6.5.1. Procedures and experiment designs第140-141页
        6.5.2. Results of Experiment 2第141-144页
        6.5.3. Summary of Experiment 2第144页
    6.6. Experiment 3:Hybrid Detection Methods第144-150页
        6.6.1. Two Objectives of Experiment 3第145页
        6.6.2. Algorithms and Procedures第145-146页
        6.6.3. Factors to be tested and experiment designs第146-147页
        6.6.4. Results of Experiment 3-1第147-149页
        6.6.5. Results of Experiment 3-2第149-150页
        6.6.6. Summary of Experiment 3第150页
    6.7. Summary:Comparing baseNP Detection Results第150-152页
Chapter 7. Chinese Complex Term Indexing with BaseNP第152-173页
    7.1. Procedures for Retrieval Experimentation第152-153页
    7.2. BaseNP Indexing Method第153-156页
        7.2.1. Overview第153-154页
        7.2.2. BaseNP Representations and Document Vectors第154-155页
        7.2.3. BaseNP Indexing Procedures第155-156页
        7.2.4. Query Processing and Representation第156页
        7.2.5. Weighting Functions and Similarity Calculation第156页
    7.3. Indexing Experimental Environment第156-162页
        7.3.1. Experimental System-CEIRS第157-159页
        7.3.2. Experimental Document and Query Collections第159-162页
    7.4. Indexing Experiments:Objectives and Designs第162-163页
    7.5. Indexing Experimental Results and Analysis第163-169页
        7.5.1. Overall Retrieval Results第163-164页
        7.5.2. Effectiveness of the BaseNP Indexing Method第164-167页
        7.5.3. Effects of baseNP Normalization第167-168页
        7.5.4. Effects of Query Length第168-169页
    7.6. Summary and Further Analysis of Retrieval Results第169-173页
Chapter 8. Conclusions and Future Work第173-180页
    8.1. General Conclusions and Implications第173-177页
    8.2. Using BaseNP:Efficiency and Other Considerations第177-178页
    8.3. Future Work第178-180页
Appendix第180-189页
Bibliography第189-197页

论文共197页,点击 下载论文
上一篇:云贵高原人工草地推荐施肥研究
下一篇:基于电流模技术的数控集成电路设计与研究