File Formats

PED

Describes individuals and their genetic data.

Columns:

###Header line * names the 8 fixed, mandatory columns * tab-delimited

The columns are:

Fields:

#CHROM: identifier from the reference genome
POS: reference position
ID: unique identifier like dbSNP rs #
REF: A, C, G, T or N, indels include base before event
ALT: non-reference alleles called on at least one of the samples; A, C, G, T, N or
QUAL: high scores indicate high confidence
FILTER: PASS or codes for filters that fail
INFO: additional info

###BAM * binary version of a SAM file * sequence alignment data

###SAM * Header lines start with ‘@’ * Alignment lines have 11 mandatory fields for essential alignment info * More Info

###CRAM * like BAM * compressed version of the alignment * More Info

snp/rs id chrm # position genotype

rs4477212 1 82154 AA rs3094315 1 752566 AG rs3131972 1 752721 AG rs12124819 1 776546 AA