Beta test Llama3 serving and GUI on MAX Nightly

You can beta test the upcoming MAX serving and Llama3 chatbot GUI using the nightly release, I'd love feedback if you run into any problems. Start the GUI by running these commands on arm64 macOS or x86/arm64 Linux (Note the smallest llama3 model is 4.5GB):
rm -rf ~/.modular
curl -s https://get.modular.com | sh -
modular auth
modular install nightly/max
MAX_NIGHTLY_PATH=$(modular config max-nightly.path)
SHELL_RC=~/.$(basename "$SHELL")rc
echo 'export MODULAR_HOME="'$HOME'/.modular"' >> $SHELL_RC
echo 'export PATH="'$MAX_NIGHTLY_PATH'/bin:$PATH"' >> $SHELL_RC
curl -fsSL https://pixi.sh/install.sh | $SHELL
source "$SHELL_RC"
git clone https://github.com/modularml/max.git ~/max
cd ~/max
git checkout nightly
cd examples/gui
pixi run gui
rm -rf ~/.modular
curl -s https://get.modular.com | sh -
modular auth
modular install nightly/max
MAX_NIGHTLY_PATH=$(modular config max-nightly.path)
SHELL_RC=~/.$(basename "$SHELL")rc
echo 'export MODULAR_HOME="'$HOME'/.modular"' >> $SHELL_RC
echo 'export PATH="'$MAX_NIGHTLY_PATH'/bin:$PATH"' >> $SHELL_RC
curl -fsSL https://pixi.sh/install.sh | $SHELL
source "$SHELL_RC"
git clone https://github.com/modularml/max.git ~/max
cd ~/max
git checkout nightly
cd examples/gui
pixi run gui
You can also ssh into a machine with vscode and run the above commands in the terminal, which will forward ports so you can run the GUI in your local browser. Work is being done to vastly improve the experience of getting up and running and MAX, stay tuned for that.
No description
27 Replies
Darin Simmons
Darin Simmonsā€¢5mo ago
@Jack Clayton It appears you have permission to the max/repo that I do not.
darin@home:~$ git clone [email protected]:modularml/max ~/max
Cloning into '/home/darin/max'...
The authenticity of host 'github.com (140.82.116.3)' can't be established.
ED25519 key fingerprint is SHA256:+DiY3wvvV6TuJJhbpZisF/zLDA0zPMSvHdkr4UvCOqU.
This key is not known by any other names
Are you sure you want to continue connecting (yes/no/[fingerprint])? yes
Warning: Permanently added 'github.com' (ED25519) to the list of known hosts.
[email protected]: Permission denied (publickey).
fatal: Could not read from remote repository.

Please make sure you have the correct access rights
and the repository exists.
darin@home:~$ git clone [email protected]:modularml/max ~/max
Cloning into '/home/darin/max'...
The authenticity of host 'github.com (140.82.116.3)' can't be established.
ED25519 key fingerprint is SHA256:+DiY3wvvV6TuJJhbpZisF/zLDA0zPMSvHdkr4UvCOqU.
This key is not known by any other names
Are you sure you want to continue connecting (yes/no/[fingerprint])? yes
Warning: Permanently added 'github.com' (ED25519) to the list of known hosts.
[email protected]: Permission denied (publickey).
fatal: Could not read from remote repository.

Please make sure you have the correct access rights
and the repository exists.
Is it the git in the [email protected] that I haven't given permissions or ? Ubuntu 22.04 clean install Simply changed it to git clone https://github.com/modularml/max.git and am able to move on
Jack Clayton
Jack ClaytonOPā€¢5mo ago
Ok cool thanks I'll fix the command, yes need to have a ssh key set for that to work thanks for raising
Darin Simmons
Darin Simmonsā€¢5mo ago
šŸ¦™ Llama3 Select a quantization encoding to download model from a predefined Model URL. If the model exists at Model Path it won't be downloaded again. You can set a custom Model URL or Model Path that matches the quantization encoding. It does say 30minute download time btw 2.34MB/s. Is Telstra serving this up from mid 2005? šŸ˜‰ Of course 5Gb download would have been 1/2 monthly at the time. I lived in Wollongong for a year.
Jack Clayton
Jack ClaytonOPā€¢5mo ago
Yeah it's a 8B model so 4.5GB, but with the Q4_K it's very fast on CPU and high quality outputs Oh nice my Dad is from the Gong, very nice area
Darin Simmons
Darin Simmonsā€¢5mo ago
yes, we learned to put beets on our chicken burgers, šŸ¤Æ . I went to uni for a year while my wife worked and volunteered and travelled. Anything you want to me try once it downloads?
Jack Clayton
Jack ClaytonOPā€¢5mo ago
You can ask it any question and make sure it's working with context for the conversation, the system prompt down the bottom says it's a coding assistant, you can change that without rebuilding the model
Darin Simmons
Darin Simmonsā€¢5mo ago
It's llama doing the llama thing. šŸ¦™ I asked it for some mojo code and it suggested import parallel and used let but it's responding and working just fine. I'm asking about going to San Francisco now with a different context. I tried "When is the best day and time to travel into San Francisco?" and it started giving a very general answer. I asked another question before it was done, and it appears to be hung. Going out to dinner, will try some more when I get back.
Jack Clayton
Jack ClaytonOPā€¢5mo ago
Cool thanks for the feedback, will make it so it locks sending another message until the current stream is finished or cancelled like ChatGPT. Much appreciated.
Darin Simmons
Darin Simmonsā€¢5mo ago
I stopped via the button on the upper right, I chose q6_k , it started the download process, I went to dinner and came back to...
Darin Simmons
Darin Simmonsā€¢5mo ago
Killing the app and restarting worked fine (defaulted to q4), I switched it q6_k and it built find and then (eventually) started. Old AMD Ryzen 3 1700 , 64GB ram.
Jack Clayton
Jack ClaytonOPā€¢5mo ago
Thanks for that cheers
Darin Simmons
Darin Simmonsā€¢5mo ago
rm -rf the whole thing, got back to q4, flipped it to q6 and it loaded fine and then ran fine
Martin Dudek
Martin Dudekā€¢5mo ago
Installation went without issues on my M2 Mac Book Pro, thanks for sharing this app. Works well except the issue @Darin Simmons already mentioned. Cancel button would be great
Jack Clayton
Jack ClaytonOPā€¢5mo ago
Awesome thanks for the report
Darin Simmons
Darin Simmonsā€¢4mo ago
I tried a frankenstein of magic and git and this (great) but it didn't work. First it complained that there is no name field in the modular/max pyproject.toml so I added name = "max" and it appeared to work but then crashed for not finding max. I wasn't surprised that it didn't work... But then I went to verify that the clean original is working and it is not working either. Streamlit fires, the UI works, but choosing a model returns an error about not finding max.
darin@home:~/max/examples/gui$ pixi run gui
āœØ Pixi task (gui): streamlit run home.py

You can now view your Streamlit app in your browser.

Local URL: http://localhost:8501
Network URL: http://192.168.86.35:8501

2024-08-01 09:21:44.103 Uncaught app exception
Traceback (most recent call last):
File "/home/darin/max/examples/gui/.pixi/envs/default/lib/python3.11/site-packages/streamlit/runtime/scriptrunner/exec_code.py", line 75, in exec_func_with_error_handling
result = func()
^^^^^^
File "/home/darin/max/examples/gui/.pixi/envs/default/lib/python3.11/site-packages/streamlit/runtime/scriptrunner/script_runner.py", line 574, in code_to_exec
exec(code, module.__dict__)
File "/home/darin/max/examples/gui/pages/bert.py", line 21, in <module>
from max import engine
ModuleNotFoundError: No module named 'max'
darin@home:~/max/examples/gui$ pixi run gui
āœØ Pixi task (gui): streamlit run home.py

You can now view your Streamlit app in your browser.

Local URL: http://localhost:8501
Network URL: http://192.168.86.35:8501

2024-08-01 09:21:44.103 Uncaught app exception
Traceback (most recent call last):
File "/home/darin/max/examples/gui/.pixi/envs/default/lib/python3.11/site-packages/streamlit/runtime/scriptrunner/exec_code.py", line 75, in exec_func_with_error_handling
result = func()
^^^^^^
File "/home/darin/max/examples/gui/.pixi/envs/default/lib/python3.11/site-packages/streamlit/runtime/scriptrunner/script_runner.py", line 574, in code_to_exec
exec(code, module.__dict__)
File "/home/darin/max/examples/gui/pages/bert.py", line 21, in <module>
from max import engine
ModuleNotFoundError: No module named 'max'
Should this module be working ? I drank coffee, removed the idiocy from my brain, installed max, confirmed max and it returns same error.
Jack Clayton
Jack ClaytonOPā€¢4mo ago
Do pixi run setup first and it should find your max installation
Darin Simmons
Darin Simmonsā€¢4mo ago
returns "āœØ Pixi task (setup): python find-max.py" , do I need to run python find-max.py? Works without running. Thanks for the help. I'll try the same in the magic project and see if it fires off as well.
Jack Clayton
Jack ClaytonOPā€¢4mo ago
Cheers, when we have magic nightly releases going, you won't have to do the setup step
sa-code
sa-codeā€¢4mo ago
Just tried this on the latest nightly with nighty mojo & max installed. I've confirmed mojo -v returns mojo 2024.8.1005 (34d14290) and max -v returns max 2024.8.914 (96cec001) Modular version 2024.8.914. However I still get this error
No description
sa-code
sa-codeā€¢4mo ago
I also get ModuleNotFoundError: No module named 'max' for StableDiffusion, even though nightly max and nightly mojo are both on the path in my .zshrc and my .zprofile
Jack Clayton
Jack ClaytonOPā€¢4mo ago
For stable diffusion, run pixi run setup first to fix it, for the first one it should be fixed on the next nightly release thanks for raising
sa-code
sa-codeā€¢4mo ago
hm no luck even after running pixi run setup šŸ˜¦ I'm not sure if I'm doing something off
Jack Clayton
Jack ClaytonOPā€¢4mo ago
can you paste your pyproject.toml after running pixi run setup?
Jack Clayton
Jack ClaytonOPā€¢4mo ago
it's not finding the MAX wheels, have you installed with modular install nightly/max? Do you have MODULAR_HOME set to the correct spot?
sa-code
sa-codeā€¢4mo ago
Confirming that I've just run modular clean and modular install nightly/max Both my zshrc and zprofile have the following lines
export MODULAR_HOME="/Users/skapadia/.modular"
export PATH="/Users/skapadia/.modular/pkg/packages.modular.com_nightly_max/bin:$PATH"
export PATH="/Users/skapadia/.modular/pkg/packages.modular.com_nightly_mojo/bin:$PATH"
export MODULAR_HOME="/Users/skapadia/.modular"
export PATH="/Users/skapadia/.modular/pkg/packages.modular.com_nightly_max/bin:$PATH"
export PATH="/Users/skapadia/.modular/pkg/packages.modular.com_nightly_mojo/bin:$PATH"
modular-cli is also up to date at modular 0.9.1 (3460d12e)
Want results from more Discord servers?
Add your server