Modular•11mo ago

Beta test Llama3 serving and GUI on MAX Nightly

You can beta test the upcoming MAX serving and Llama3 chatbot GUI using the nightly release, I'd love feedback if you run into any problems. Start the GUI by running these commands on arm64 macOS or x86/arm64 Linux (Note the smallest llama3 model is 4.5GB):

rm -rf ~/.modular
curl -s https://get.modular.com | sh -
modular auth
modular install nightly/max
MAX_NIGHTLY_PATH=$(modular config max-nightly.path)
SHELL_RC=~/.$(basename "$SHELL")rc
echo 'export MODULAR_HOME="'$HOME'/.modular"' >> $SHELL_RC
echo 'export PATH="'$MAX_NIGHTLY_PATH'/bin:$PATH"' >> $SHELL_RC
curl -fsSL https://pixi.sh/install.sh | $SHELL
source "$SHELL_RC"
git clone https://github.com/modularml/max.git ~/max
cd ~/max
git checkout nightly
cd examples/gui
pixi run gui

rm -rf ~/.modular
curl -s https://get.modular.com | sh -
modular auth
modular install nightly/max
MAX_NIGHTLY_PATH=$(modular config max-nightly.path)
SHELL_RC=~/.$(basename "$SHELL")rc
echo 'export MODULAR_HOME="'$HOME'/.modular"' >> $SHELL_RC
echo 'export PATH="'$MAX_NIGHTLY_PATH'/bin:$PATH"' >> $SHELL_RC
curl -fsSL https://pixi.sh/install.sh | $SHELL
source "$SHELL_RC"
git clone https://github.com/modularml/max.git ~/max
cd ~/max
git checkout nightly
cd examples/gui
pixi run gui

You can also ssh into a machine with vscode and run the above commands in the terminal, which will forward ports so you can run the GUI in your local browser. Work is being done to vastly improve the experience of getting up and running and MAX, stay tuned for that.

27 Replies

Darin Simmons•11mo ago

@Jack Clayton It appears you have permission to the max/repo that I do not.

darin@home:~$ git clone [email protected]:modularml/max ~/max
Cloning into '/home/darin/max'...
The authenticity of host 'github.com (140.82.116.3)' can't be established.
ED25519 key fingerprint is SHA256:+DiY3wvvV6TuJJhbpZisF/zLDA0zPMSvHdkr4UvCOqU.
This key is not known by any other names
Are you sure you want to continue connecting (yes/no/[fingerprint])? yes
Warning: Permanently added 'github.com' (ED25519) to the list of known hosts.
[email protected]: Permission denied (publickey).
fatal: Could not read from remote repository.

Please make sure you have the correct access rights
and the repository exists.

darin@home:~$ git clone [email protected]:modularml/max ~/max
Cloning into '/home/darin/max'...
The authenticity of host 'github.com (140.82.116.3)' can't be established.
ED25519 key fingerprint is SHA256:+DiY3wvvV6TuJJhbpZisF/zLDA0zPMSvHdkr4UvCOqU.
This key is not known by any other names
Are you sure you want to continue connecting (yes/no/[fingerprint])? yes
Warning: Permanently added 'github.com' (ED25519) to the list of known hosts.
[email protected]: Permission denied (publickey).
fatal: Could not read from remote repository.

Please make sure you have the correct access rights
and the repository exists.

Is it the git in the [email protected] that I haven't given permissions or ? Ubuntu 22.04 clean install Simply changed it to git clone https://github.com/modularml/max.git and am able to move on

Jack ClaytonOP•11mo ago

Ok cool thanks I'll fix the command, yes need to have a ssh key set for that to work thanks for raising

Darin Simmons•11mo ago

🦙 Llama3 Select a quantization encoding to download model from a predefined Model URL. If the model exists at Model Path it won't be downloaded again. You can set a custom Model URL or Model Path that matches the quantization encoding. It does say 30minute download time btw 2.34MB/s. Is Telstra serving this up from mid 2005? 😉 Of course 5Gb download would have been 1/2 monthly at the time. I lived in Wollongong for a year.

Jack ClaytonOP•11mo ago

Yeah it's a 8B model so 4.5GB, but with the Q4_K it's very fast on CPU and high quality outputs Oh nice my Dad is from the Gong, very nice area

Darin Simmons•11mo ago

yes, we learned to put beets on our chicken burgers, 🤯 . I went to uni for a year while my wife worked and volunteered and travelled. Anything you want to me try once it downloads?

Jack ClaytonOP•11mo ago

You can ask it any question and make sure it's working with context for the conversation, the system prompt down the bottom says it's a coding assistant, you can change that without rebuilding the model

Darin Simmons•11mo ago

It's llama doing the llama thing. 🦙 I asked it for some mojo code and it suggested import parallel and used let but it's responding and working just fine. I'm asking about going to San Francisco now with a different context. I tried "When is the best day and time to travel into San Francisco?" and it started giving a very general answer. I asked another question before it was done, and it appears to be hung. Going out to dinner, will try some more when I get back.

Jack ClaytonOP•11mo ago

Cool thanks for the feedback, will make it so it locks sending another message until the current stream is finished or cancelled like ChatGPT. Much appreciated.

Darin Simmons•11mo ago

I stopped via the button on the upper right, I chose q6_k , it started the download process, I went to dinner and came back to...

Darin Simmons•11mo ago

message.txt

Darin Simmons•11mo ago

Killing the app and restarting worked fine (defaulted to q4), I switched it q6_k and it built find and then (eventually) started. Old AMD Ryzen 3 1700 , 64GB ram.

Jack ClaytonOP•11mo ago

Thanks for that cheers

Darin Simmons•11mo ago

rm -rf the whole thing, got back to q4, flipped it to q6 and it loaded fine and then ran fine

Martin Dudek•11mo ago

Installation went without issues on my M2 Mac Book Pro, thanks for sharing this app. Works well except the issue @Darin Simmons already mentioned. Cancel button would be great

Jack ClaytonOP•11mo ago

Awesome thanks for the report

Darin Simmons•9mo ago

I tried a frankenstein of magic and git and this (great) but it didn't work. First it complained that there is no name field in the modular/max pyproject.toml so I added name = "max" and it appeared to work but then crashed for not finding max. I wasn't surprised that it didn't work... But then I went to verify that the clean original is working and it is not working either. Streamlit fires, the UI works, but choosing a model returns an error about not finding max.

darin@home:~/max/examples/gui$ pixi run gui
✨ Pixi task (gui): streamlit run home.py

  You can now view your Streamlit app in your browser.

  Local URL: http://localhost:8501
  Network URL: http://192.168.86.35:8501

2024-08-01 09:21:44.103 Uncaught app exception
Traceback (most recent call last):
  File "/home/darin/max/examples/gui/.pixi/envs/default/lib/python3.11/site-packages/streamlit/runtime/scriptrunner/exec_code.py", line 75, in exec_func_with_error_handling
    result = func()
             ^^^^^^
  File "/home/darin/max/examples/gui/.pixi/envs/default/lib/python3.11/site-packages/streamlit/runtime/scriptrunner/script_runner.py", line 574, in code_to_exec
    exec(code, module.__dict__)
  File "/home/darin/max/examples/gui/pages/bert.py", line 21, in <module>
    from max import engine
ModuleNotFoundError: No module named 'max'

darin@home:~/max/examples/gui$ pixi run gui
✨ Pixi task (gui): streamlit run home.py

  You can now view your Streamlit app in your browser.

  Local URL: http://localhost:8501
  Network URL: http://192.168.86.35:8501

2024-08-01 09:21:44.103 Uncaught app exception
Traceback (most recent call last):
  File "/home/darin/max/examples/gui/.pixi/envs/default/lib/python3.11/site-packages/streamlit/runtime/scriptrunner/exec_code.py", line 75, in exec_func_with_error_handling
    result = func()
             ^^^^^^
  File "/home/darin/max/examples/gui/.pixi/envs/default/lib/python3.11/site-packages/streamlit/runtime/scriptrunner/script_runner.py", line 574, in code_to_exec
    exec(code, module.__dict__)
  File "/home/darin/max/examples/gui/pages/bert.py", line 21, in <module>
    from max import engine
ModuleNotFoundError: No module named 'max'

Should this module be working ? I drank coffee, removed the idiocy from my brain, installed max, confirmed max and it returns same error.

Jack ClaytonOP•9mo ago

Do pixi run setup first and it should find your max installation

Darin Simmons•9mo ago

returns "✨ Pixi task (setup): python find-max.py" , do I need to run python find-max.py? Works without running. Thanks for the help. I'll try the same in the magic project and see if it fires off as well.

Jack ClaytonOP•9mo ago

Cheers, when we have magic nightly releases going, you won't have to do the setup step

sa-code•9mo ago

Just tried this on the latest nightly with nighty mojo & max installed. I've confirmed mojo -v returns mojo 2024.8.1005 (34d14290) and max -v returns max 2024.8.914 (96cec001) Modular version 2024.8.914. However I still get this error

sa-code•9mo ago

I also get ModuleNotFoundError: No module named 'max' for StableDiffusion, even though nightly max and nightly mojo are both on the path in my .zshrc and my .zprofile

Jack ClaytonOP•9mo ago

For stable diffusion, run pixi run setup first to fix it, for the first one it should be fixed on the next nightly release thanks for raising

sa-code•9mo ago

hm no luck even after running pixi run setup 😦 I'm not sure if I'm doing something off

Jack ClaytonOP•9mo ago

can you paste your pyproject.toml after running pixi run setup?

sa-code•9mo ago

sure thing https://gist.github.com/sa-/e27da004ebb5e1388994a3ddaf95b063

Jack ClaytonOP•9mo ago

it's not finding the MAX wheels, have you installed with modular install nightly/max? Do you have MODULAR_HOME set to the correct spot?

sa-code•9mo ago

Confirming that I've just run modular clean and modular install nightly/max Both my zshrc and zprofile have the following lines

export MODULAR_HOME="/Users/skapadia/.modular"
export PATH="/Users/skapadia/.modular/pkg/packages.modular.com_nightly_max/bin:$PATH"
export PATH="/Users/skapadia/.modular/pkg/packages.modular.com_nightly_mojo/bin:$PATH"

export MODULAR_HOME="/Users/skapadia/.modular"
export PATH="/Users/skapadia/.modular/pkg/packages.modular.com_nightly_max/bin:$PATH"
export PATH="/Users/skapadia/.modular/pkg/packages.modular.com_nightly_mojo/bin:$PATH"

modular-cli is also up to date at modular 0.9.1 (3460d12e)

Gaming

Programming

Beta test Llama3 serving and GUI on MAX Nightly

Did you find this page helpful?