from what i’ve tried it works but error messages are pretty off and the type checking is hard to get around without a bunch of extra calls does llvm_instrinsic work?

Jack Clayton•16mo ago

Potentially, you could experiment with https://docs.nvidia.com/cuda/nvvm-ir-spec/index.html

1. Introduction — NVVM IR Specification 12.3 documentation

Reference guide to the NVVM compiler IR (intermediate representation) based on the LLVM IR.

Jack Clayton•16mo ago

Great blog post by the way!

bennyOP•16mo ago

EXACTLY what I was looking for, thanks Jack :) thank you! Would you be able to share how you would call something like fsub? I have tried llvm.fsub, llvm.fsub.f32, llvm.operations.binary.fsub, etc, nothing seems to be working

Jack Clayton•16mo ago

Perhaps llvm.nvvm.fsub, not sure if it'll work for you though

bennyOP•16mo ago

What about llvm.vp.fsub? llvm.nvvm.fsub says not found, but im getting this issue with fneg let neg_x = llvm_intrinsic["llvm.vp.fneg", SIMD[DType.float32, nelts]](x, SIMD[DType.bool, nelts].splat(True), nelts)

call intrinsic signature float (float, i1, i64) to overloaded intrinsic "llvm.vp.fneg" does not match any of the overloads

I assume this is an issue with SIMD, but I am not sure Okay, i’ve gotten it working but i’m having a few issues. 1. it is slower than the native implementation cpu (this could be a issue with the code structure though) 2. the most basic operations like fneg, sub, add, are all not prefixed with llvm, therefore they are blocked by the llvm_intrinsic command, is there an alternative?

Jack Clayton•16mo ago

Not that I'm aware of, I haven't ventured down that path yet. A GPU related module for Mojo is coming in the future though, so you can use the language itself. It's just not ready yet.

bennyOP•16mo ago

Perfect Jack, thanks 🔥

Gaming

Programming

Tutorial / Example of targetting GPU with llvm_instrinsic or __mlir_op etc?

Did you find this page helpful?