This function allows a user to read data from a hadoop sequence
file. A sequence file consists of (key value) pairs sequentially. At
the moment, org.apache.hadoop.io.Text
is the only serialization type
being supported, and there is no compression support.
sequence_file_dataset(filenames)
filenames | A |
---|
# NOT RUN { dataset <- sequence_file_dataset("testdata/string.seq") %>% dataset_repeat(1) sess <- tf$Session() iterator <- make_iterator_one_shot(dataset) next_batch <- iterator_get_next(iterator) until_out_of_range({ batch <- sess$run(next_batch) print(batch) }) # }