[2024-03-05 23:36:41,948] [INFO] [real_accelerator.py:191:get_accelerator] Setting ds_accelerator to cuda (auto detect)
dP dP dP
88 88 88
.d8888b. dP. .dP .d8888b. 88 .d8888b. d8888P 88
88' `88 `8bd8' 88' `88 88 88' `88 88 88
88. .88 .d88b. 88. .88 88 88. .88 88 88
`88888P8 dP' `dP `88888P' dP `88888P' dP dP
[2024-03-05 23:36:43,224] [INFO] [axolotl.normalize_config:178] [PID:607731] [RANK:0] GPU memory usage baseline: 0.000GB (+0.498GB misc)
[2024-03-05 23:36:43,333] [WARNING] [axolotl.cli.preprocess.do_cli:54] [PID:607731] [RANK:0] preprocess CLI called without dataset_prepared_path set, using default path: last_run_prepared
[2024-03-05 23:36:43,725] [DEBUG] [axolotl.load_tokenizer:245] [PID:607731] [RANK:0] EOS: 2 / </s>
[2024-03-05 23:36:43,725] [DEBUG] [axolotl.load_tokenizer:246] [PID:607731] [RANK:0] BOS: 1 / <s>
[2024-03-05 23:36:43,725] [DEBUG] [axolotl.load_tokenizer:247] [PID:607731] [RANK:0] PAD: 2 / </s>
[2024-03-05 23:36:43,725] [DEBUG] [axolotl.load_tokenizer:248] [PID:607731] [RANK:0] UNK: 0 / <unk>
[2024-03-05 23:36:43,725] [INFO] [axolotl.load_tokenizer:259] [PID:607731] [RANK:0] No Chat template selected. Consider adding a chat template for easier inference.
[2024-03-05 23:36:43,725] [INFO] [axolotl.load_tokenized_prepared_datasets:191] [PID:607731] [RANK:0] Unable to find prepared dataset in last_run_prepared/6c807fe9ce94fe4afac210baedaf3996
[2024-03-05 23:36:43,725] [INFO] [axolotl.load_tokenized_prepared_datasets:192] [PID:607731] [RANK:0] Loading raw datasets...
Downloading data files: 100%|██████████████████| 1/1 [00:00<00:00, 14169.95it/s]
Extracting data files: 100%|████████████████████| 1/1 [00:00<00:00, 2332.76it/s]
Generating train split: 500 examples [00:00, 128762.33 examples/s]
Tokenizing Prompts (num_proc=64): 100%|█| 500/500 [00:01<00:00, 386.09 examples/
[2024-03-05 23:36:46,074] [INFO] [axolotl.load_tokenized_prepared_datasets:414] [PID:607731] [RANK:0] merging datasets
[2024-03-05 23:36:46,079] [DEBUG] [axolotl.log:61] [PID:607731] [RANK:0] min_input_len: 14
[2024-03-05 23:36:46,080] [DEBUG] [axolotl.log:61] [PID:607731] [RANK:0] max_input_len: 14
Dropping Long Sequences (num_proc=72): 100%|█| 500/500 [00:00<00:00, 1895.44 exa
[2024-03-05 23:36:46,943] [INFO] [axolotl.load_tokenized_prepared_datasets:424] [PID:607731] [RANK:0] Saving merged prepared dataset to disk... last_run_prepared/6c807fe9ce94fe4afac210baedaf3996
Saving the dataset (1/1 shards): 100%|█| 500/500 [00:00<00:00, 50445.05 examples
[2024-03-05 23:36:46,965] [DEBUG] [axolotl.log:61] [PID:607731] [RANK:0] total_num_tokens: 7000
[2024-03-05 23:36:46,966] [DEBUG] [axolotl.log:61] [PID:607731] [RANK:0] `total_supervised_tokens: 5500`
[2024-03-05 23:36:46,966] [DEBUG] [axolotl.log:61] [PID:607731] [RANK:0] total_num_steps: 75
[2024-03-05 23:36:46,967] [INFO] [axolotl.scripts.load_datasets:387] [PID:607731] [RANK:0] check_dataset_labels...
[2024-03-05 23:36:46,969] [INFO] [axolotl.check_example_labels:35] [PID:607731] [RANK:0] <s>(1, 1) Hello(22557, 22557)
(13, 13) hi(12014, 12014) there(736, 736) !(28808, 28808) .(28723, 28723) (28705, 28705) good(-100, 1179) bye(-100, 17664) (-100, 28705) fare(19111, 19111) well(5458, 5458) </s>(2, 2)
[2024-03-05 23:36:46,969] [INFO] [axolotl.check_example_labels:36] [PID:607731] [RANK:0]
[2024-03-05 23:36:46,969] [INFO] [axolotl.scripts.load_datasets:400] [PID:607731] [RANK:0] printing prompters...
[2024-03-05 23:36:46,969] [INFO] [axolotl.scripts.load_datasets:402] [PID:607731] [RANK:0] Pre-tokenized or custom dataset types are unsupported for logging
[2024-03-05 23:36:46,973] [INFO] [axolotl.cli.preprocess.do_cli:62] [PID:607731] [RANK:0] Success! Preprocessed data path: `dataset_prepared_path: last_run_prepared`