When you run it, it will create the following directory tree inside split:
formula-nametrainformulaname
testformulaname
formula-inorganic-organictraininorganicorganic
test- ...
name-inorganic-organictrain- ...
test- ...
Then, it will split the data read from source into single-line files and place them inside the final directories.
