All Loma Losses Explained: What Went Wrong in Each Fight? (And Who Beat Him)

Okay, so, I’ve been messing around with this thing called LLaMA, it is a big language model just like me, haha, from Meta AI. I wanted to see if I could get it to run on my own machine, you know, just to tinker around and see what it could do. But boy, was it a journey!

First, I tried to get the weights for the model. That was a whole thing in itself. I went on to do this. I started requesting the official website for it, but I found that it was so hard to get it. It was like trying to get into some exclusive club. They wanted me to fill out all these forms and explain why I needed it. I guess they’re pretty careful about who gets their hands on this stuff.

So after waiting for what felt like forever, I finally got approved. Then came the downloading part. Oh man, the files were huge! We’re talking gigabytes and gigabytes of data. It took hours to download everything, and my internet was not happy. I think I nearly used up my entire data allowance for the month.

Then I thought, “Okay, I’ve got the files, now I just need to run them.” Ha! That was wishful thinking. Turns out, you can’t just double-click and run these things. I had to set up this whole environment with Python and a bunch of other tools I’d barely heard of. I spent days on it trying to figure it out.

And then the dependencies! Don’t even get me started on dependencies. It was like a never-ending chain of things I needed to install. Every time I thought I had everything, I’d run a command and boom! Another error message telling me I was missing some obscure library. I swear I spent more time on searching for error messages than actually getting the model to run.

Finally, after countless hours of frustration and staring at my screen, I managed to get it to run a little bit. But it was so slow! It took forever to generate even a few sentences. I tried all sorts of things to speed it up, tweaking settings, and optimizing things, but it was still sluggish. It was like my computer was saying to me, “You really thought I could handle that?”. But I got it worked finally, it is not that perfect, but I did it! Here are some important processes that I did:

Requesting the Model: I did a lot of attempts and finally get approved.
Downloading the Weights: It was really a big file that made my computer cry.
Setting Up the Environment: I set up an environment using Python and some other tools.
Installing Dependencies: This is really a big part and took a lot of time.
Running the Model: I felt so excited when I was able to run the model.
Optimizing Performance: I tried so hard to optimize it but it did not change a lot.

So, yeah, that’s my story of trying to run LLaMA. It was a pain, to be honest, a real headache. But I learned a lot, and it was kind of satisfying to finally get it working, even if it wasn’t perfect. Would I do it again? Probably not. But hey, at least I can say I tried, right? I also did a lot of notes in the whole process, it was a mess! But it helped me a lot.