Platform for Drug Discovery


Dataset type:fastq (paired-end)


Description

File example


    FASTQ-formatted DNA sequence file (paired-end)

    Image Image

Available suffix for file name when you upload files as this dataset type

    '.fastq: ''.fq' or '.txt'

Available pair of file name examples

  • Type 1
    #file name templateexamplefile type
    1(prefix)_1(suffix)SRR039649_1.fastqFile for the first sequence in the pair.
    2(prefix)_2(suffix)SRR039649_2.fastqFile for the second sequence in the pair.
    3(prefix)(suffix)SRR039649.fastqExtra sequence file includes orphan. This file is not required and most tools ignore this.
  • Type 2
    #file name templateexamplefile type
    1(prefix)_(1-digit number)_1_sequence(suffix)s_3_1_sequence.txtFile for the first sequence in the pair.
    2(prefix)_(1-digit number)_2_sequence(suffix)s_3_2_sequence.txtFile for the first barcode sequence.
    3(prefix)_(1-digit number)_3_sequence(suffix)s_3_3_sequence.txtFile for the second sequence in the pair.
    4(prefix)_(1-digit number)_4_sequence(suffix)s_3_4_sequence.txtFile for the second barcode sequence. This file is not required.
  • Type 3
    #file name templateexamplefile type
    1(prefix)_(1-digit number)_1_sequence(suffix)s_3_1_sequence.txtFile for the first sequence in the pair.
    2(prefix)_(1-digit number)_2_sequence(suffix)s_3_2_sequence.txtFile for the second sequence in the pair.
  • Type 4
    #file name templateexamplefile type
    1(prefix)_R1(suffix)SAMPLE1_ATCACG_L002_R1.fastqFile for the first sequence in the pair.
    2(prefix)_R2(suffix)SAMPLE1_ATCACG_L002_R2.fastqFile for the second sequence in the pair.
  • Type 5
    #file name templateexamplefile type
    1(prefix)_R1_(3-digits number)(suffix)s_G1_L001_R1_002.fastqFile for the first sequence in the pair.
    2(prefix)_R2_(3-digits number)(suffix)s_G1_L001_R2_002.fastqFile for the second sequence in the pair.
    • (prefix): Any string of characters includes alphabet, number and some special characters (".","_" or "-") is available.
    • (suffix):".fastq", ".fq" or ".txt" is available.

Available sequence ID for pairs

  • Type 1
    • Identical in the pair
    • File 1 ID exampleCorrelated file 2 ID example
      ERR003169.1 IL37_2099:6:1:1:405
      ERR003169.2 IL37_2099:6:1:1:1526
      ERR003169.3 IL37_2099:6:1:1:1553
      ERR003169.4 IL37_2099:6:1:1:1187
      ERR003169.5 IL37_2099:6:1:1:1382
      ERR003169.1 IL37_2099:6:1:1:405
      ERR003169.2 IL37_2099:6:1:1:1526
      ERR003169.3 IL37_2099:6:1:1:1553
      ERR003169.4 IL37_2099:6:1:1:1187
      ERR003169.5 IL37_2099:6:1:1:1382
    • All Read archive data have this type of sequence ID pairs.
  • Type 2
    • ID with postfix "/1" in the first read and "/2" in the second read
    • File 1 ID exampleCorrelated file 2 ID example
      ABC-DEF123_0011_FC:5:1:2409:953#0/1
      ABC-DEF123_0011_FC:5:1:3289:953#0/1
      ABC-DEF123_0011_FC:5:1:4441:953#0/1
      ABC-DEF123_0011_FC:5:1:4664:951#0/1
      ABC-DEF123_0011_FC:5:1:4773:951#0/1
      ABC-DEF123_0011_FC:5:1:2409:953#0/2
      ABC-DEF123_0011_FC:5:1:3289:953#0/2
      ABC-DEF123_0011_FC:5:1:4441:953#0/2
      ABC-DEF123_0011_FC:5:1:4664:951#0/2
      ABC-DEF123_0011_FC:5:1:4773:951#0/2
    • Illumina raw data have this type of sequence ID pairs.