| Acknowledgements | 第1-5页 |
| 摘要 | 第5-8页 |
| Abstract | 第8-13页 |
| Contents | 第13-19页 |
| List of Tables | 第19-21页 |
| List of Figures | 第21-22页 |
| List of Acronyms and Abbreviations | 第22-23页 |
| Chapter 1 Introduction | 第23-51页 |
| ·Noticing the Significance | 第24-39页 |
| ·Test Scores and Language Related Research | 第24-27页 |
| ·Test Scores and Decision Making in Educational Programs | 第27-36页 |
| ·Selection and Test Scores for Selection Decisions | 第28-30页 |
| ·Placement and Test Scores for Placement Decisions | 第30-32页 |
| ·Diagnosis and Test Scores for Diagnostic Decisions | 第32-33页 |
| ·Test Scores and Program Evaluation | 第33-35页 |
| ·Minimum Competence and Test Scores in Minimum Competence Decisions | 第35-36页 |
| ·Test Scores and the Reliability and Uncertainty of Test Results | 第36-39页 |
| ·Identifying the Object for Research | 第39-47页 |
| ·Identifying Some Theoretical Problems | 第40-45页 |
| ·The Important and the Neglected | 第40-44页 |
| ·The Conflicts between Theories | 第44-45页 |
| ·Identifying Some Practical Needs | 第45-47页 |
| ·The Bleak Picture of Testing Practice in China | 第45-46页 |
| ·The Hopeful Future in China’s Testing Practice | 第46-47页 |
| ·Overview of the Dissertation | 第47-50页 |
| ·Purpose and Score of the Study | 第47-48页 |
| ·Study Questions | 第48页 |
| ·Overview of the Dissertation | 第48-50页 |
| ·Summary | 第50-51页 |
| Chapter 2 Types of Language Tests | 第51-91页 |
| ·Language Tests: Norm-Referenced and Criterion-Referenced | 第51-63页 |
| ·Norm-Referenced Tests | 第52-56页 |
| ·The Origin and Types of Norm-Referencing | 第52-54页 |
| ·The Distinctive Features of a Norm-Referenced Test | 第54-55页 |
| ·Purposes and Scores for Norm-Referencing | 第55-56页 |
| ·Criterion-Referenced Tests | 第56-63页 |
| ·The Origin and Types of Criterion-Referencing | 第56-59页 |
| ·The Distinctive Features of a Criterion-Referenced Test | 第59-60页 |
| ·Purposes of and Scores for Criterion-Referencing | 第60-63页 |
| ·Language Tests: Power and Speed | 第63-66页 |
| ·Power Tests | 第63-65页 |
| ·Definition and Design Features | 第63-64页 |
| ·Purpose and Score for a Power Test | 第64-65页 |
| ·Speed Tests | 第65-66页 |
| ·Definition and Design Features | 第65-66页 |
| ·Purpose of and Score for a Speed Test | 第66页 |
| ·Language Tests: Mental Power and Mental Work | 第66-71页 |
| ·Tests of Mental Power | 第67-68页 |
| ·Definition and Design Features | 第67-68页 |
| ·Purpose of and Score for a Test of Mental Power | 第68页 |
| ·Tests of Mental Work | 第68-71页 |
| ·Definition and Design Features | 第68-69页 |
| ·Purpose of and Score for a Test of Mental Work | 第69-71页 |
| ·Language Tests; Extensive and Intensive | 第71-77页 |
| ·Extensive Tests | 第71-74页 |
| ·Definition and Design Features | 第71-73页 |
| ·Purpose of and Score for a Test of Extensive Quantity | 第73-74页 |
| ·Intensive Tests | 第74-77页 |
| ·Definition and Design Features | 第74-75页 |
| ·Purpose of and Score for a Test of Intensive Quantity | 第75-77页 |
| ·Language Tests: Weakness Based and Strength Based | 第77-80页 |
| ·Definition and Design Features of the Weakness Based Tests | 第78-79页 |
| ·Purposes of and Score for a Weakness Based Test | 第79-80页 |
| ·Language Tests: Nominal, Ordinal, Interval, and Ratio | 第80-90页 |
| ·Tests at the Nominal Level of Measurement | 第82-83页 |
| ·Definition | 第82-83页 |
| ·Property of the Scale, Statistics Allowed and Common Mistakes or Misbelieves | 第83页 |
| ·Tests at the Ordinal Level of Measurement | 第83-85页 |
| ·Definition | 第83-84页 |
| ·Property of the Scale, Statistics Allowed and Common Mistakes or Misbelieves | 第84-85页 |
| ·Tests at the Interval Level of Measurement | 第85-87页 |
| ·Definition | 第85页 |
| ·Property of the Scale, Statistics Allowed and Common Mistakes or Misbelieves | 第85-87页 |
| ·T ests at the Ratio Level of Measurement | 第87-90页 |
| ·Definition | 第87-88页 |
| ·Property of the Scale, Statistics Allowed and Common Mistakes or Misbelieves | 第88-90页 |
| ·Summary | 第90-91页 |
| Chapter 3 The Derivation of Scores for Language Tests | 第91-157页 |
| ·Scale, Scaling, Score and Scoring | 第92-99页 |
| ·Scale | 第92-94页 |
| ·Scaling | 第94-96页 |
| ·Score | 第96-97页 |
| ·Scoring | 第97-99页 |
| ·Some Frequently Used Score Scales: a Critical Review | 第99-135页 |
| ·The Raw Score Scale | 第100-108页 |
| ·Definition and Illustration | 第100-103页 |
| ·Application(s) | 第103-106页 |
| ·Evaluating the Scale | 第106-108页 |
| ·The Percentile Rank Score Scale | 第108-112页 |
| ·Definition and Illustration | 第109-110页 |
| ·Application(s) | 第110-111页 |
| ·Evaluating the Scale | 第111-112页 |
| ·The Standard Score Scale | 第112-124页 |
| ·Definition and Illustration | 第113-122页 |
| ·Application(s) | 第122-123页 |
| ·Evaluating the Scale | 第123-124页 |
| ·The Grade Equivalent Score Scale | 第124-128页 |
| ·Definition and Illustration | 第125-126页 |
| ·Application(s) | 第126-127页 |
| ·Evaluating the Scale | 第127-128页 |
| ·The Latent Trait Score Scale | 第128-135页 |
| ·Definition and Illustration | 第128-131页 |
| ·Application(s) | 第131-132页 |
| ·Evaluating the Scale | 第132-135页 |
| ·The Standardized Item-Based Score Scale | 第135-147页 |
| ·Definition and Illustration | 第136-141页 |
| ·Application(s) | 第141-143页 |
| ·Evaluating the Models | 第143-147页 |
| ·Three Models for Scoring | 第147-156页 |
| ·Limitations of Conventional Scoring Models | 第148-149页 |
| ·Fundamental Considerations of Scoring Models | 第149-152页 |
| ·Three Scoring Models | 第152-156页 |
| ·The Power Scoring Models | 第152-154页 |
| ·The Logistic Scoring Model | 第154-155页 |
| ·Standard Uncertainty of the Generated Scores | 第155页 |
| ·Some General Suggestions | 第155-156页 |
| ·Summary | 第156-157页 |
| Chapter 4 The Reporting of Language Test Scores | 第157-191页 |
| ·Some General Considerations of Score Reporting | 第158-176页 |
| ·The Purposes of Testing | 第159-161页 |
| ·The Primary Purposes of Testing | 第159-160页 |
| ·The Secondary Purposes of Testing | 第160-161页 |
| ·The Anticipated Users of Test Results | 第161-165页 |
| ·The Non-qualified Users | 第162-163页 |
| ·The Less-qualified Users | 第163-164页 |
| ·The Well-qualified Users | 第164-165页 |
| ·Information on the Score Report and Information Reserved for the Supporting Documents. | 第165-176页 |
| ·Information on the Score Report | 第166-170页 |
| ·What to Be Provided in the Supporting Documents | 第170-176页 |
| ·Some Technical Considerations of Score Reporting | 第176-190页 |
| ·True Score, Its Estimate and the Uncertainty of the Estimate | 第176-185页 |
| ·The True Score | 第176-177页 |
| ·The Estimates of True Scores | 第177-178页 |
| ·The Uncertainty of an Estimate: Its Evaluation and Expression | 第178-184页 |
| ·The Correction for Guessing | 第184-185页 |
| ·The Reliability of Test Scores | 第185-190页 |
| ·The Stability of Scores | 第185-186页 |
| ·The Parallel Form Reliability | 第186-187页 |
| ·The Generalizability of Observed Scores over the Item Universe | 第187-189页 |
| ·The Generalizability of Observed Scores over the Rater Universe | 第189页 |
| ·The Generalizability of Observed Scores over Both the Item and the Rater Universe | 第189-190页 |
| ·Summary | 第190-191页 |
| Chapter 5 The Interpretation of Language Test Scores | 第191-228页 |
| ·Validity and Score Interpretation | 第192-201页 |
| ·The Evolving Concept of Validity | 第192-199页 |
| ·Validity as Test-Criterion Correlation | 第193-194页 |
| ·Validity as Consisting of Different Types | 第194-197页 |
| ·Validity as a Unitary Concept | 第197-199页 |
| ·Validity as the Appropriateness of Score Interpretation | 第199-201页 |
| ·Norms and Norm-Referenced Score Interpretation | 第201-218页 |
| ·Norms and Norming | 第202-210页 |
| ·Norms, Norm Groups and the Criteria for Norms | 第202-203页 |
| ·Classification of Norms | 第203-210页 |
| ·Interpreting Test Scores by Referencing to the Norms | 第210-218页 |
| ·Interpreting Test Scores by Referencing to the Percentile Rank Norms | 第210-215页 |
| ·Interpreting Test Scores by Referencing to the Group Average Norm. | 第215-217页 |
| ·Summary of the Section | 第217-218页 |
| ·Criterion and Criterion-Referenced Score Interpretation | 第218-226页 |
| ·The Criterion | 第218-221页 |
| ·Criterion as Mastery of Domain Knowledge | 第219页 |
| ·Criterion as Performance on Target Tasks | 第219-220页 |
| ·Criterion as Proficiency in Relation to Future Needs | 第220-221页 |
| ·Criterion-Referenced Score Interpretation | 第221-226页 |
| ·Interpreting the Criterion Score by Referencing to the Cut Score(s) | 第222-223页 |
| ·Interpreting the Criterion Score by Referencing to the Expectancy Table | 第223-224页 |
| ·Interpreting the Criterion Score by Referencing to Proficiency Descriptors | 第224-225页 |
| ·Interpreting the Criterion Score by Referencing to the Scoring Standards | 第225-226页 |
| ·Summary | 第226-228页 |
| Chapter 6 Analyzing the TEM | 第228-262页 |
| ·Background Information | 第228-242页 |
| ·General Background Information | 第228-229页 |
| ·A Brief History of TEM | 第229-230页 |
| ·A Brief History of TEM 4 | 第229页 |
| ·A Brief History of TEM 8 | 第229-230页 |
| ·The Growing Population of TEM | 第230-234页 |
| ·The Growing Population of TEM 4 | 第230-232页 |
| ·The Growing Population of TEM 8 | 第232-234页 |
| ·The Changing Formats of TEM | 第234-242页 |
| ·The Changing Formats of TEM 4 | 第234-239页 |
| ·The Changing Formats of TEM 8 | 第239-242页 |
| ·Analyzing the Structure of the TEM Test | 第242-255页 |
| ·The Semantic Structure of the E-TEM 4 | 第242-249页 |
| ·The Surface Structure | 第242-244页 |
| ·The Deep Structure | 第244-249页 |
| ·The Structure of the New Generation TEM 4 | 第249-251页 |
| ·The Structure of TEM 8 | 第251-255页 |
| ·The Surface Structure of TEM 8 | 第251-253页 |
| ·The Deep Structure of TEM 8 | 第253-255页 |
| ·The TEM Scoring Practice and the TEM Certificates | 第255-261页 |
| ·Marking the TEM Tests | 第255-257页 |
| ·Machine-marking the Multiple Choice Questions | 第255页 |
| ·Hand-marking the Constructed Response Questions | 第255-257页 |
| ·Reporting the TEM Result | 第257-259页 |
| ·Reporting the TEM Score at the Individual Level | 第258页 |
| ·Reporting the TEM Score at the Institutional Level | 第258-259页 |
| ·Granting the TEM Certificates | 第259-261页 |
| ·Summary | 第261-262页 |
| Chapter 7 Some Recommendations for the TEM | 第262-282页 |
| ·General Recommendations for TEM | 第263-268页 |
| ·Purposes and Intended Uses of TEM | 第263-264页 |
| ·D imensionality and Testing Methods of TEM | 第264-265页 |
| ·Raw Score or Scale Score? Skill Scores or Total Score? | 第265页 |
| ·The TEM Certificates | 第265-267页 |
| ·Training Score Interpreters | 第267页 |
| ·Building an Official Website for the TEM | 第267-268页 |
| ·Technical Recommendations for TEM | 第268-281页 |
| ·Scoring TEM | 第268-273页 |
| ·The Dimensionality of TEM | 第268-269页 |
| ·Score Scales | 第269-273页 |
| ·Reporting TEM Result | 第273-277页 |
| ·The TEM Score Report | 第273-275页 |
| ·The Uncertainty and Normative Information of TEM | 第275-277页 |
| ·Interpreting TEM Scores | 第277-281页 |
| ·Descriptors | 第278-280页 |
| ·Users’Guide | 第280-281页 |
| ·Summary | 第281-282页 |
| Chapter 8 Concluding Remarks | 第282-289页 |
| ·Major Contributions | 第282-285页 |
| ·Theoretical Contributions | 第282-284页 |
| ·Practical Contributions | 第284-285页 |
| ·Limitations and Suggestions for Further Research | 第285-287页 |
| ·Summary | 第287-289页 |
| Bibliography | 第289-300页 |