rstringdna-sequencesystemtime

Reading a file from a position in R


I have a large plain text file to be read in R, where all data is contained at the same line with no spaces (DNA sequence with no header). I found the next function:

readChar("filename",nchar=n)

which allows to read just the "n" first elements of the file saving a lot of time. Is there another function in R that goes further by reading just from START position to STOP one, avoiding to upload the whole file?


Solution

  • Basically no, from what i know, you need to read the whole file and then discard the characters that you don't want. For example, if you want only the first 10 letters for every line:

    strsub(readChar("filename",nchar=n),1,10)
    

    But, this post (How to efficiently read the first character from each line of a text file?) shows some ways of improving the efficiency of that.