More

Mougatine · 2025-05-12T11:14:57 1747048497

very cool work!

Mougatine · on Nov 19, 2023

Mougatine · on Nov 6, 2023

"From scratch" is commonly used in the field.

popinman322 · on Nov 6, 2023

Among arxiv publications there are 217 results that contain "large language model" in the full text and "from scratch" in the title or abstract.

There are 2873 results that contain "large language model" in the full text and use "pretrained" in the title or abstract. A 10x difference in publication count does make one feel more common than the other?

I'd need to get into more involved queries to break down the semantic categories of those papers.

light_hue_1 · on Nov 6, 2023

From scratch simply means that they didn't base it off some other llm.

This is perfectly good language and exactly the correct thing for them to say.

Mougatine · on Feb 10, 2023

In a way, it's similar to the actors selling their digital images in the movie The Congress. It's inevitable.

BlueTemplar · on Feb 10, 2023

Based (how closely ?) on the 1971 black humour science fiction novel "The Futurological Congress" by Stanisław Lem !

(For those that would not know, he coined the term "robot" from the slavic root "work".)

blackshaw · on Feb 10, 2023

Are you sure? Wikipedia says the word was coined by Karel Capek:

https://wikiless.org/wiki/Karel_%C4%8Capek?lang=en

BlueTemplar · on Feb 10, 2023

Oh damn. My bad - it's funny, I even though to myself : «he would have been quite young, wouldn't he ?»

andrewfromx · on Feb 10, 2023

wow, not many people have seen this movie. 100%.

CaptWillard · on Feb 10, 2023

I'd never heard of it, but definitely going to check it out now. A lot has happened in the ten years since it was created.

Mougatine · on Aug 4, 2022

You don't need a PhD for DS/ML Engineer. Even at FAANG, even in their research labs.

Usually a PhD is only in the requirements for Research Scientist (RS).

That said, I did a PhD (and am now a RS). It's a fantastic opportunity to learn fully focused during a few years.

But if your sole objective is the career (which is ok!), don't do a PhD. There are much easier ways to break into ML industry.

joshvm · on Aug 4, 2022

To add to this, companies at Google-scale tend to have a huge variety of ML related jobs, ranging from low level things like optimising libraries for different hardware, to the more general research positions where people are working on their own pet projects. Plus everything in between - data management and curation for training models that get used in production, people who try and figure out how to productionise cutting edge research, people who build the infrastructure that other ML engineers use (and here again, everything from hardware/server people, cloud, site reliability, tooling) and the list goes on.

I know of at least one person who got an ML job at Google, but didn't apply specifically for it. They had a very strong ML background and applied for a generic software engineering and got team matched. That seems like a reasonable way to go if you don't want to go through a research interview loop.

dekhn · on Aug 4, 2022

I;'d like to echo this. I learned a long time ago that I don't want to be a "machine learning engineer"- I have no interest in designing new networks, feature selection, or training as a daily job. I know how to do all those things but it's not somethign I pursued at Google. Instead, I found jobs where I could work with those people (often the ones doing the real state of the art research at scale) using my experience, in ML, data engineering, pipelines, and HPC.

There is nothing quite like having a world-class researcher ask you to figure out why their model is exploding, and tracking down the crazy things that happen on TPUs when their math isn't absolutely perfect, then helping them fix it, and see them publish their results (or put them in prod). Or knowing enough software and hardware to debug a tensorflow TPU problem with an oscilloscope connected to the voltage regulator in a hardware lab.

Personally, i gained these skills over a long period starting in the mid-90s (working on machiine learning, and then later HPC for biology, and ultimately back to machine learning). But I am a slow learner. probably the shortest path is to get accepted to a major university and do really well in your ML and CS classes, then parlay that into a job in a FAAMG, then figure out what you want to do with all your skillz.

pvarangot · on Aug 4, 2022

I got a unicorn senior RS offer without a PhD from a company that had mostly former FAANG top brass after interviewing without knowing it was for RS, I thought it was DS/ML. I declined because of "culture fit". Everyone has a PhD, they assumed I had one, I don't even have a bachelors. We still hang out and laugh about it.

I had been working in Attitude Determination and Control and Optical Systems Engineering for seven years before that interview and I just like, knew the stuff from the job. I've been back on pure-SWE roles for four years already and I don't think I could do it now. I have the intuition but I couldn't white board proofs for tree based algos and manipulate integrals like I did on that interview for sure.

Mougatine · on June 25, 2022

You could really raise money (millions?) in less than 3 days before? That sounds crazy to me.

oofbey · on June 25, 2022

It’s not impossible to happen that fast. A SAFE takes almost no lawyer time to prepare, and sometimes investors think there’s good reason to move that fast. But the fact that it does happen sometimes doesn’t mean that’s “normal” in any way. Even great people with really solid ideas usually take weeks to really put together and close a deal like that.

dustingetz · on June 25, 2022

a “3 day raise” can mean many things, imagine something like this: 2-3 strategic angel investors already in place; prior to “starting fundraising” is a 1 month period of pitch discovery during which your angels are introducing you to investors but you are “not fundraising yet”; during this period investors start asking to invest but you are “not fundraising yet”; once you hit like $500k in interest, you email all the investors you’re already talking to and say “i’m fundraising now and already have $500k in interest for a 1.5M round” and one seed fund takes the remaining $1M and you’re done (the three days is for diligence). or 4 famous angels follow with $250k checks and you’re done. it’s an orchestrated process

Mougatine · on April 7, 2022

https://arthurdouillard.com/deepcourse/

It's a deep learning course I built for my students around some kinds of "skill tree" like in video games.

PS: I know CSS can be a bit broken on some browsers, I don't care enough to fix that.

Mougatine · on Feb 11, 2022

That sounds nice! I'll try it.

Is it possible to then export the cards to Anki?

roznoshchik · on Feb 11, 2022

You can export yes, not entirely sure if you can export to Anki.

You can export in either txt or json.

And you can export 3 ways.

1. All highlights & notes from an article (article = epub / web article) 2. All highlights & notes from a particular topic/tag 3. All highlights & notes from your entire account organized by article.

I think that's how it works. Wrote the export code a while ago so the details escape me at the moment.

Mougatine · on Jan 25, 2022

A $100,500 bounty seems pretty cheap compared to the severity of the issue, or is it common?

runjake · on Jan 25, 2022

It's the most that Apple's paid out for a bug bounty, as far as I know. The previous highest was $500 less ($100,000).

tptacek · on Jan 26, 2022

It's also an interaction-required bug, apparently.

ffhhj · on Jan 26, 2022

Now imagine how much money they saved by not researching those bugs themselves.

jtbayly · on Jan 26, 2022

Now imagine how much the researcher gave up by not selling it to Cellebrite.

tptacek · on Jan 26, 2022

You mean to say someone like NSO Group, not Cellebrite. But you should know that it's possible driving up the price of bugs helps companies like NSO, rather than hurting them. They're middlemen, taking a cut of the value of transactions between exploit developers and downstream customers. Those downstream customers, for shops like NSO, are overwhelmingly government agencies that aren't especially price-sensitive to the cost of individual bugs.

Thorrez · on Jan 26, 2022

I assume NSO group operates in their own best interest. If them buying a bug and reselling it hurts them, then I think they won't do it.

Although I guess one reason they might buy a bug that would lead to financial harm is to prevent a competitor from getting it, which might be an even worse financial harm.

saagarjha · on Jan 26, 2022

Cellebrite doesn't really have a use for a browser vulnerability.

Mougatine · on Nov 14, 2021

Shameless plug: I've made a deep learning course oriented with practical content on a wide variety of computer vision topics --> https://arthurdouillard.com/deepcourse/

with slides, google colab, and anki cards

mkl · on Nov 14, 2021

I'll look at this properly when I'm not on mobile, but I noticed some minor issues. A typo that seems to be repeated a few times: "space-repetition" should be "spaced-repetition". There are also several unnecessary capitals in your opening sentence.

Mougatine · on Nov 15, 2021

Thanks, for the remarks, I'll fix that.

ahevia · on Nov 14, 2021

Anki Cards? I’m sold

jay3ss · on Nov 14, 2021

This looks great