Add GLM4 model #33729

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

Sign up for GitHub

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Jump to bottom

Closed

Cyrilvallez wants to merge 39 commits into huggingface:main from Cyrilvallez:glm

Member

Cyrilvallez commented Sep 26, 2024

What does this PR do?

Adds GLM model.

HuggingFaceDocBuilderDev commented Sep 26, 2024

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

ArthurZucker reviewed

View reviewed changes

Collaborator

ArthurZucker left a comment

Super nice, you are missing the test files, integration tests etc! (And readme etc)

src/transformers/models/glm/configuration_glm.py Outdated Show resolved Hide resolved

src/transformers/models/glm/configuration_glm.py Show resolved Hide resolved

src/transformers/models/glm/configuration_glm.py Show resolved Hide resolved

src/transformers/models/glm/configuration_glm.py Outdated

+                      initializer_range=0.02,
+                      rms_norm_eps=0.00000015625,
+                      use_rms_norm=True,
+                      apply_residual_connection_post_layernorm=False,

Collaborator

ArthurZucker Sep 26, 2024

is this false for all models? If so, to delete!

src/transformers/models/glm/modular_glm.py Outdated

+                      self.mlp = GlmMLP(config)
+                      self.input_layernorm = (
+                          GlmRMSNorm(config.hidden_size, eps=config.rms_norm_eps)
+                          if config.use_rms_norm

Collaborator

ArthurZucker Sep 26, 2024

check what config uses, but we avoid that in general as well! (code path)

src/transformers/models/glm/modular_glm.py Outdated

+                      """
+                      hidden_states_after_norm = self.input_layernorm(hidden_states)
+                      residual = hidden_states_after_norm if self.apply_residual_connection_post_layernorm else hidden_states

Collaborator

ArthurZucker Sep 26, 2024

same here! check if any released models have both

src/transformers/models/glm/modular_glm.py Outdated

+                      self.layers = nn.ModuleList(
+                          [GlmDecoderLayer(config, layer_idx) for layer_idx in range(config.num_hidden_layers)]
+                      )
+                      if config.post_layer_norm:

Collaborator

ArthurZucker Sep 26, 2024

same here

src/transformers/models/glm/__init__.py

Collaborator

ArthurZucker Sep 26, 2024

Init should be like this: https://github.com/huggingface/transformers/pull/31329/files#diff-e13de4b5db0c6872b5f0ec197d07fdaf80174b37f55bd4fefbe1526e57635683
(you could not know but we ought to enforce this now!)

ArthurZucker reviewed

View reviewed changes

src/transformers/models/glm/modular_glm.py Show resolved Hide resolved

Cyrilvallez mentioned this pull request

Add GLM-4 and Later GLM Model (Draft) #31977

Closed

3 tasks

ArthurZucker and others added 26 commits

September 30, 2024 16:03


          fix converter for function definitions


          Hqq serialization (huggingface#33141)

18b2c0c

* HQQ model serialization attempt

* fix hqq dispatch and unexpected keys

* style

* remove check_old_param

* revert to check HQQLinear in quantizer_hqq.py

* revert to check HQQLinear in quantizer_hqq.py

* update HqqConfig default params

* make ci happy

* make ci happy

* revert to HQQLinear check in quantizer_hqq.py

* check hqq_min version 0.2.0

* set axis=1 as default in quantization_config.py

* validate_env with hqq>=0.2.0 version message

* deprecated hqq kwargs message

* make ci happy

* remove run_expected_keys_check hack + bump to 0.2.1 min hqq version

* fix unexpected_keys hqq update

* add pre_quantized check

* add update_expected_keys to base quantizerr

* ci base.py fix?

* ci base.py fix?

* fix "quantization typo" src/transformers/utils/quantization_config.py

Co-authored-by: Arthur <[email protected]>

* fix post merge

---------

Co-authored-by: Marc Sun <[email protected]>
Co-authored-by: Arthur <[email protected]>


          Create modular_glm.py

30400c0


          Update modular_glm.py

5677a55


          Finalize architecture without all attentions

c4c9f5c


          Add all attentions modules


          Finalize modular

ff01996


          Update given last version


          Last update

603421e


          Finalize model

9e0dfee


          Finalize converter

414100d


          Update convert_glm_weights_to_hf.py

b816507


          style

590321b


          style

85cbd60


          Create __init__.py

7ba2f3a


          Aff all inits

fd727a6


          Update convert_glm_weights_to_hf.py

dfa54bb


          Update convert_glm_weights_to_hf.py

ecd5bf4


          Update convert_glm_weights_to_hf.py

ccacc3b


          Update convert_glm_weights_to_hf.py

bd9b9ee


          Update convert_glm_weights_to_hf.py

bba6b12


          Update convert_glm_weights_to_hf.py

13934a8


          Update convert_glm_weights_to_hf.py

678062d


          Update convert_glm_weights_to_hf.py

967944f


          Update convert_glm_weights_to_hf.py

2588ee7


          Correct the rotary embeddings

8756c10

Cyrilvallez added 13 commits

September 30, 2024 16:18


          Remove apply_residual_connection_post_layernorm (always false)

e633c22


          remove use_rms_norm (always true)

ea3ee4e


          remove past_layer_norm (always true)

c2f0a8d


          Update __init__.py

e251352


          Update config and license

4fbcfce


          start adding tests and doc

a1692ab


          Add doc + style

3c80274


          Update test_modeling_glm.py

73ecb14


          Add dummies

d15d0e5


          Update back init (because __all__ is not generated from modular)

eacc0ad


          Update convert_glm_weights_to_hf.py

499f7a5


          Update convert_glm_weights_to_hf.py

65085d2


          apply corrected modular

d273a11

Cyrilvallez force-pushed the glm branch from b10be6a to d273a11 Compare

September 30, 2024 14:23

ArthurZucker reviewed

View reviewed changes

Collaborator

ArthurZucker left a comment

Something went wrong with the rebasing / merging as you have unrelated changes!

src/transformers/models/glm/modular_glm.py

		}


		class GlmDecoderLayer(nn.Module):

Collaborator

ArthurZucker Sep 30, 2024

this one looks fairly classic, I would have supposed you don't need the forward (unless the issue is with the name of layers?)

Member Author

Cyrilvallez commented Sep 30, 2024

Something went wrong with the rebasing / merging as you have unrelated changes!

Yes, currently looking at it

Cyrilvallez closed this

Collaborator

ArthurZucker commented Oct 2, 2024

Superseed by #33823

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet