Skip to content
Snippets Groups Projects
  1. Oct 03, 2024
  2. Jun 18, 2024
  3. Feb 20, 2024
  4. Feb 19, 2024
  5. Feb 16, 2024
  6. Feb 14, 2024
  7. Feb 13, 2024
  8. Feb 12, 2024
  9. Feb 08, 2024
    • Philipp's avatar
      Preview/hf backend refactor (#45) · 14581acb
      Philipp authored
      
      * HF backend refactor (#37)
      
      - Model settings are now stored in external JSON model registry
      - Two new prototyping methods to check messages for issues and context limit
      - General Exception to handle exceeding context
      - Gated model (Llama2) handling integrated into single HF backend
      - New and updated documentation
      - use huggingfaces' cache lookup mechanism
      
      Co-authored-by: default avatarJonathan Jordan <knawurzelkopp@hotmail.de>
      Unverified
      14581acb
  10. Feb 06, 2024
  11. Jan 24, 2024
  12. Jan 18, 2024
    • Philipp's avatar
      Feat/refactor model pairing (#35) · 7fc95710
      Philipp authored
      Fixes #34
      
      - now clem automatically assumes self-play when only a single model is given (and the game is multi-player)
      - removed the option to run all games (as this simplifies the code and is now done in a pipeline script)
      - improved cli usage `python3 scripts/cli.py run -g <game> -m <model1> [<model2>]`
      - pipeline scripts and README have been updated (no changes in the games)
      Unverified
      7fc95710
  13. Jan 17, 2024
  14. Jan 12, 2024
  15. Jan 11, 2024
  16. Jan 10, 2024
  17. Jan 08, 2024
    • Jonathan Jordan's avatar
      Hf model updates (#32) · f2deb947
      Jonathan Jordan authored
      * Add openchat_3.5, Yi-34B-Chat, Orca-2-13b model support; Add custom openchat and ChatML chat templates; Add 'slow' tokenizer usage handling
      
      * Add culling of openchat EOS token at end of outputs
      
      * Remove Yi-34B-Chat and Orca-2-13b from supported models list due to issues found in testing
      
      * Add Yi-34B-Chat output processing; Add Yi-34B-Chat to supported models list
      
      * Add DeepSeek 7B/67B, Tulu 7B/70B and Mixtral 8x7B models
      
      * Changed transformers version requirement to 4.36.0
      
      * Changed accelerate version requirement to 0.25.0
      
      * Add output EOS culling for DeepSeek
      
      * Add culling of empty iniitla system message for Mistral models compatibility
      
      * Add SUSTech/SUS-Chat-34B to huggingface_local_api.py
      
      * Add SUSTech/SUS-Chat-34B output EOS culling
      Unverified
      f2deb947
    • Jonathan Jordan's avatar
      DOC: Adding new models to HF backend (#31) · 6b245a8d
      Jonathan Jordan authored
      * Started model checking cli script
      
      * Finish initial_hf_check.py; Add HowTo doc on adding models to the HF backend
      Unverified
      6b245a8d
  18. Dec 21, 2023
    • Jonathan Jordan's avatar
      HF backend model additions: Yi-34B-Chat, Openchat-3.5, Tulu-2, DeepSeek and Mixtral (#29) · fc5a5461
      Jonathan Jordan authored
      * Add openchat_3.5, Yi-34B-Chat, Orca-2-13b model support; Add custom openchat and ChatML chat templates; Add 'slow' tokenizer usage handling
      
      * Add culling of openchat EOS token at end of outputs
      
      * Remove Yi-34B-Chat and Orca-2-13b from supported models list due to issues found in testing
      
      * Add Yi-34B-Chat output processing; Add Yi-34B-Chat to supported models list
      
      * Add DeepSeek 7B/67B, Tulu 7B/70B and Mixtral 8x7B models
      
      * Changed transformers version requirement to 4.36.0
      
      * Changed accelerate version requirement to 0.25.0
      
      * Add output EOS culling for DeepSeek
      Unverified
      fc5a5461
  19. Dec 01, 2023
Loading