It seems to be small language model (2.2 gigabytes in memory) and it runs on GPU (at least on M1 Ultra) predictions are pretty bad compared to LLAMA 2/3 or GPT 3.5.
Here some short video of some minimal swift project with single struct that i asked to free memory of. (It sadly failed I tested in on my bigger projects with a lot of Metal and HPC and it never make usable code completion for my comments. On the other hand it made some usable (?) documentation for my methods inside some structs)
Anyone got some good results with it (some tricks maybe :?)