reduce maxTokens for glm-4-9b-chat to fit 50GB GPU#47
Open
nicole-lihui wants to merge 1 commit intoBaizeAI:mainfrom
Open
reduce maxTokens for glm-4-9b-chat to fit 50GB GPU#47nicole-lihui wants to merge 1 commit intoBaizeAI:mainfrom
nicole-lihui wants to merge 1 commit intoBaizeAI:mainfrom