Datafiles come in many different flavors/shapes and your success in obtaining meaningful results depends on your ability to process them into a form that is understood by your stats program of choice. Although the requirements for importing data will vary greatly among programs, the most useful (i.e., commonly recognized) structure of a datafile will have column entries within a line separated by a "tab" (i.e., ASCII name: - "ht"; C Escape Sequence: "\t"; Hexadecimal character code $0x09). A line is separated from the next row of data points with an "end of line" character. These unfortunately vary depending on the computer platform. Textfiles formatted for MacOS indicate an end of line with a "carriage return" character (i.e., ASCII name: - "cr"; C Escape Sequence: "\r"; Hexadecimal character code $0x0D); Files formated for UnixOS code the start of a new line with a "line feed" character (i.e., ASCII name: - "lf" or "nl"; Hexadecimal character code $0x0A); Dos/Windows files use both characters ("cr", "lf") for that purpose. A good text editor will allow you to convert between these. Take a good look at the file first: How many rows are there? Are there multiple entries per line? How are they separated? What line endings are there? Are there missing data points? Does the first line contain data or variable names? Based on these considerations design a strategy to process the file - if needed. The following exercises are meant to familiarize you with some common things.
Competences earned this week:
Special Reward: