Visit the SPSS home page.
The program will open an empty datafile window. There are many other windows in SPSS, many of which will be described in this brief tutorial. The datafile window is where your data is stored.
To open data file dialog window:
click on "File,"
click on "Open" (This provides opportunity to acces previously created
data files)
double click on desired file
To save a file you use the"Save" file and "Save As" commands
A few things to note:
Columns correspond to variables
Rows correspond to cases (i.e. indivduals)
SPSS will try to "format" the column based on the first thing that you type in. So


Once you have opened your file, click on the "Variable View" tab at the
bottom of the window. This will take you to a similar looking
spreadsheet. The difference is that each row in this spreadsheet contains
the characteristics of each column in the other (data view) spreadsheet.

So the first row corresponds to the variable 'IDnumber', the second row to 'Name' (depending on how you set it up, it could also be last name or first name), the next to 'age', etc.
The columns on this spreadsheet are the characteristics of the variables. These include: name, type, width, decimals, labels, values, missing, columns, align, & measure.
Type - This refers to the kind of data that is being inputed into
the cells. So for example, income can be made a 'dollar' type.

Width - this is the number of characters that the data cell will allow within it. For example, for the 'names' variable, you'll want to set it pretty high, to fit the entire names in here. (note: this can also be set in the "type popup window" see above picture)
Decimals - this is the number of places after the decimal place. So for example, ID number doesn't need any numbers after the decimal place, so you can set this for zero. (note: this can also be set in the "type popup window" see above picture)
Labels - This allows you to give your variables a longer, more detailed name (not limited to 8 characters). For example, for income, you could enter a label like "income earned in 1999."
Values - these are VERY IMPORTANT for CATEGORICAL variables. When entering gender into the data file, you probably typed in Male and Female. While this looks fine, it isn't the way that gender should be inputed if you want to ever use the variable in an analysis. Gender is a categorical variable (nominal). SPSS requires that categorical variables be entered in as numbers.
![]() |
So Male should be 1 and Female 2 (or visa versa, just be
consistent). Then you can use the values to specify that 1 = male and 2 =
female. Enter the number of the category in the 'value' field and the name of the category in the 'value label' field. Then click the 'add' button.
You can get the data view spreadsheet to list categorical variables by
their numbers or their labels by clicking the label button in the menu.
|
Missing - it isn't uncommon that for some individuals in your dataset, there may be some missing information. There are many different ways to deal with this problem. This is one way to let SPSS know how to treat missing data for each variable. For now, we will ignore this characteristic.
Columns - this specifies how wide you want the variable column to look in the data window.
Align - this specifies how the data should be aligned in a column. Your choices are centered, left, and right.
Measure - In this column, you should specify what scale of measurement the variable has been measured in. Note that interval and ratio have been combined into one "scale" type.

Step 2: Then select "quiz1" for the sort variable field.

Computing New Variables |
Recoding |
This allows you to change the way in which you have coded a variable. For example you may want to change all of your 1's and 2's to 0's and 1's. SPSS gives you two options:
Choose Transform, Recode from menu bar
SPSS will create frequency distribution tables for you. Go to the "Analyze" menu, select "Descriptive statistics", and within that sub menu select "Frequencies".

SPSS will then ask you for which variable you want the table for.

For quiz 1 the frequency table output should look something like this:


Bar charts are also useful for presenting other summary statistics, like means, for different groups. For example suppose that we wanted to know the mean on quiz 1 by the three different sections.
In this case the histogram is a little different than you might expect after comparing it to the frequency distribution table above. Why?
In a scatterplot, you would simply put your two variables into the Y and X axes (for example, maybe you have test scores on two quizzes). It generally doesn't matter which variable is specified to be on the x axis and which variable is specified on the y axis. The one shown below gets even a little more complicated in that different points are represented by different colors to indicate which section that person belongs to (of 3 course sections). You should notice that each point represents a case or person and the point shows the intersection of their two quiz scores.
Your scatterplot should look like this.