ASSESSMENT ESSENTIALS
PRESIDENT'S MESSAGE  
WHAT IS ASSESSMENT?  
ASSESSMENT TERMS »

PRINCIPLES OF GOOD PRACTICE

 
ASSESSMENT FEEDBACK LOOPS  
EIGHT LEARNING OUTCOMES  
ASSESSMENT FAQs  
UNIVERSITY ASSESSMENT COMMITTEE  
ASSESSMENT PLAN  
HISTORY OF ASSESSMENT AT NWHSU  
ASSESSMENT TOOLS  
ASSESSMENT TERMS

Assessment: an ongoing process aimed at understanding and improving student learning. It involves: making our expectations explicit and public; setting appropriate criteria and high standards for learning quality; systematically gathering, analyzing, and interpreting evidence to determine how well performance matches those expectations and standards; and using the resulting information to document, explain, and improve performance.

Backload (--ed, --ing): amount of effort required after the data collection.
Competency: level at which performance is acceptable.
Convergent validity: general agreement among ratings, gathered independently of one another, where measures should be theoretically related.

Criterion-referenced: criterion-referenced tests determine what test takers can do and what they know, not how they compare to others. Criterion-referenced tests report how well students are doing relative to a pre-determined performance level on a specified set of educational goals or outcomes included in the curriculum.

Externality: the extent to which the results of the assessment can be generalized to a similar context.

External validity: the extent to which the results of a study are generalizable or transferable to other settings. Generalizability is the extent to which assessment findings and conclusions from a study conducted on a sample population can be applied to the population at large. Transferability is the ability to apply the findings in one context to another similar context.

Forced-choice: the respondent only has a choice among given responses (e.g., very poor, poor, fair, good, very good).

Formative assessment: intended to assess ongoing program/project activity and provide information to improve the project. Assessment feedback is short term in duration.

Frontload (--ed, --ing): amount of effort required in the early stage of assessment method development or data collection.

Generalization (generalizability): the extent to which assessment findings and conclusions from a study conducted on a sample population can be applied to the population at large.

Goal-free evaluation: focuses on actual outcomes rather than intended program outcomes. Evaluation is done without prior knowledge of the goals of the program.

Inter-rater reliability: the degree to which different raters/observers give consistent estimates of the same phenomenon.

Internal validity: (1) the rigor with which the study was conducted (e.g., the study's design, the care taken to conduct measurements, and decisions concerning what was and wasn't measured) and, (2) the extent to which the designers of a study have taken into account alternative explanations for any causal relationships they explore.

Longitudinal studies: data collected from the same population at different points in time.

Norm (--ative): a set standard of development or achievement usually derived from the average or median achievement of a large group.

Norm-reference: a norm-referenced test is designed to highlight achievement differences between and among students to produce a dependable rank order of students across a continuum of achievement from high achievers to low achievers.

Observer effect: the degree to which the assessment results are affected by the presence of an observer.

Open-ended: assessment questions that are designed to permit spontaneous and unguided responses.

Operational (--ize): defining a term or object so that it can be measured. Generally states the operations or procedures used that distinguish it from others.

Reliability: the extent to which an experiment, test, or any measuring procedure yields the same result on repeated trials

Rubrics: a set of categories that define and describe the important components of the work being completed, critiqued, or assessed. Each category contains a gradation of levels of completion or competence with a score assigned to each level and a clear description of what criteria need to be met to attain the score at each level.

Salience: a striking point or feature.

Stakeholder: anyone who has a vested interest in the outcome of the program/project.

Summative assessment: assessment that is done at the conclusion of a course or some larger instructional period (e.g., at the end of the program). The purpose is to determine success or to what extent the program/project/course met its goals.

Third party: person(s) other than those directly involved in the educational process (e.g., employers, parents, consultants)

Triangulate (triangulation): the use of a combination of assessment methods in a study. An example of triangulation would be an assessment that incorporated surveys, interviews, and observations.

Topology: mapping of the relationships among subjects.

Utility: usefulness of assessment results.

Variable (variability): observable characteristics that vary among individuals responses.

Validity: the degree to which a study accurately reflects or assesses the specific concept that the researcher is attempting to measure. Validity has three components:

relevance - the option measures your educational objective as directly as possible

accuracy - the option measures your educational objective as precisely as possible

utility - the option provides formative and summative results with clear implications for educational program evaluation and improvement


Copyright © Northwestern Health Sciences University.