mario_lm = MarioLM(lm=BASE, tokenizer=BASE) The parameter here prompts an error. #20

54457616 · 2023-06-30T15:24:04Z

TrainingConfig and MarioGPTTrainer cannot be used.
mario_lm = MarioLM(lm=BASE, tokenizer=BASE) The parameter here prompts an error. Is there no specific program for this training?

Can you give some specific suggestions？

shyamsn97 · 2023-06-30T15:57:05Z

Hey! What version of mario-gpt are you running? Can you try a pip install mario-gpt —upgrade?

54457616 · 2023-06-30T16:04:27Z

The PYTHON version used by the system is 3.10, and the corresponding program version is the latest download from the website. The first step ran smoothly without any problems, as shown in the figure, and the above error will appear in the second step.

shyamsn97 · 2023-06-30T17:16:54Z

Can I see the full stacktrace? Because from what I see above it looks like the error is coming from:

mario_lm = MarioLM(lm_path=BASE, tokenizer_path=BASE)

But below it looks like its working? Doesn't really look like an issue with the trainer / training config.

I ran the code in a new clean workspace:

>>> import torch
>>> from mario_gpt import MarioDataset, MarioLM, TrainingConfig, MarioGPTTrainer
>>> BASE = "distilgpt2"
>>> mario_lm = MarioLM(lm_path=BASE, tokenizer_path=BASE)
Using distilgpt2 lm
/home/shyam/miniconda3/envs/py39/lib/python3.9/site-packages/transformers/models/auto/modeling_auto.py:1352: FutureWarning: The class `AutoModelWithLMHead` is deprecated and will be removed in a future version. Please use `AutoModelForCausalLM` for causal language models, `AutoModelForMaskedLM` for masked language models and `AutoModelForSeq2SeqLM` for encoder-decoder models.
  warnings.warn(
Some weights of GPT2LMHeadModel were not initialized from the model checkpoint at distilgpt2 and are newly initialized: ['transformer.h.0.crossattention.c_attn.weight', 'transformer.h.3.crossattention.c_attn.weight', 'transformer.h.4.crossattention.bias', 'transformer.h.5.crossattention.bias', 'transformer.h.2.crossattention.q_attn.weight', 'transformer.h.3.ln_cross_attn.weight', 'transformer.h.2.crossattention.c_proj.weight', 'transformer.h.2.crossattention.c_proj.bias', 'transformer.h.2.ln_cross_attn.weight', 'transformer.h.5.crossattention.c_proj.bias', 'transformer.h.3.crossattention.c_proj.bias', 'transformer.h.0.crossattention.c_proj.bias', 'transformer.h.5.crossattention.c_proj.weight', 'transformer.h.5.ln_cross_attn.weight', 'transformer.h.3.crossattention.masked_bias', 'transformer.h.1.crossattention.c_proj.weight', 'transformer.h.5.crossattention.c_attn.weight', 'transformer.h.1.crossattention.masked_bias', 'transformer.h.1.crossattention.c_proj.bias', 'transformer.h.3.crossattention.c_proj.weight', 'transformer.h.0.ln_cross_attn.weight', 'transformer.h.1.crossattention.bias', 'transformer.h.3.crossattention.bias', 'transformer.h.5.crossattention.masked_bias', 'transformer.h.5.crossattention.q_attn.weight', 'transformer.h.1.crossattention.q_attn.weight', 'transformer.h.1.crossattention.c_attn.weight', 'transformer.h.4.crossattention.q_attn.weight', 'transformer.h.0.crossattention.bias', 'transformer.h.3.crossattention.q_attn.weight', 'transformer.h.0.crossattention.masked_bias', 'transformer.h.4.crossattention.c_proj.bias', 'transformer.h.4.crossattention.c_attn.weight', 'transformer.h.2.crossattention.bias', 'transformer.h.0.crossattention.c_proj.weight', 'transformer.h.4.crossattention.c_proj.weight', 'transformer.h.2.crossattention.masked_bias', 'transformer.h.1.ln_cross_attn.weight', 'transformer.h.0.crossattention.q_attn.weight', 'transformer.h.4.ln_cross_attn.weight', 'transformer.h.4.crossattention.masked_bias', 'transformer.h.2.crossattention.c_attn.weight']
You should probably TRAIN this model on a down-stream task to be able to use it for predictions and inference.
Using distilgpt2 tokenizer

Can you try doing a pip uninstall mario-gpt and then re run the python setup.py install step again?

54457616 · 2023-07-01T14:18:12Z

I will redeploy according to your suggestion, the main problem before is mario_lm = MarioLM(lm=BASE, tokenizer=BASE)
The second question is TrainingConfig, MarioGPTTrainer.

54457616 · 2023-07-01T15:09:47Z

The program has been uninstalled

The program is installed successfully

Sampling runs correctly

The front of Train is running normally

Error running after Train

shyamsn97 · 2023-07-04T17:45:21Z

Ah looks like accelerator changed their api. I’ll update it!

54457616 · 2023-07-14T09:15:02Z

Whether the relevant modification is completed, I look forward to your revision, I hope to continue to debug your results, thank you.

shyamsn97 · 2023-08-02T20:34:12Z

Should be fixed now! Let me know if you still have errors

54457616 · 2023-08-03T15:25:01Z

With your revision, my local operation can be successfully completed at present. Can you give a general description of the relevant files you have modified? And how can the content generated by training be better for fine-tuning the model? Before you provide the modification, after your reminder, I can run normally under the original code when I use the accelerate==0.16.0 previous version.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

mario_lm = MarioLM(lm=BASE, tokenizer=BASE) The parameter here prompts an error. #20

mario_lm = MarioLM(lm=BASE, tokenizer=BASE) The parameter here prompts an error. #20

54457616 commented Jun 30, 2023

shyamsn97 commented Jun 30, 2023

54457616 commented Jun 30, 2023

shyamsn97 commented Jun 30, 2023

54457616 commented Jul 1, 2023

54457616 commented Jul 1, 2023

shyamsn97 commented Jul 4, 2023

54457616 commented Jul 14, 2023

shyamsn97 commented Aug 2, 2023

54457616 commented Aug 3, 2023

mario_lm = MarioLM(lm=BASE, tokenizer=BASE) The parameter here prompts an error. #20

mario_lm = MarioLM(lm=BASE, tokenizer=BASE) The parameter here prompts an error. #20

Comments

54457616 commented Jun 30, 2023

shyamsn97 commented Jun 30, 2023

54457616 commented Jun 30, 2023

shyamsn97 commented Jun 30, 2023

54457616 commented Jul 1, 2023

54457616 commented Jul 1, 2023

shyamsn97 commented Jul 4, 2023

54457616 commented Jul 14, 2023

shyamsn97 commented Aug 2, 2023

54457616 commented Aug 3, 2023