8a5f80bb61
Currently `BLOOMSharded` is a subclass of `CausalLM`, while it skips `CausalLM`'s constructor. This is a supprising behavior that we might want to avoid. This PR extract `CausalLM`'s constructor to `AutoCausalLM` to detect settings from `model_id`, so that we don't have to skip `CausalLM`'s constructor. |
||
---|---|---|
.. | ||
test_bloom.py | ||
test_causal_lm.py | ||
test_model.py | ||
test_santacoder.py | ||
test_seq2seq_lm.py |