• 9 Posts
  • 661 Comments
Joined 11 months ago
cake
Cake day: December 18th, 2023

help-circle

  • The “battle” is the result of copyright people trying to use open source people for their ends.

    In the past, for software, the focus was completely on the terms of the license. If you look at OSI’s new definition, you will find no mention of that, despite the fact that common licenses in the AI world are not in line with traditional standards. The big focus is data, because that is what copyright people care about. AI trainers are supposed to provide extensive documentation on training data. That’s exactly the same demand that the copyright lobby managed to get into the european AI Act. They will use that to sue people for piracy.

    Of course, what the copyright people really want is free money. They’re spreading the myth that training data is like source code and training like compiling. That may seem like a harmless, flawed analogy. But the implication is that the people who work and pay to do open source AI have actually done nothing except piracy. If they can convince judges or politicians who don’t understand the implications then this may cause a lot of damage.





  • Thank you. Since we decided a few weeks ago to adopt the leaf as legal tender, we have, of course, all become immensely rich.

    But we have also run into a small inflation problem on account of the high level of leaf availability, which means that, I gather, the current going rate has something like three deciduous forests buying on ship’s peanut.

    So in order to obviate this problem and effectively revalue the leaf, we are about to embark on a massive defoliation campaign, and…er, burn down all the forests. I think you’ll all agree that’s a sensible move under the circumstances.
















  • If the same user can generate the same input, it will result in the same hash.

    Yes, if. I don’t know if you can guarantee that. It’s all fun and games as long as you’re doing English. In other languages, you get characters that can be encoded in more than 1 way. User at home has a localized keyboard with a dedicated key for such a character. User travels across the border and has a different language keyboard and uses a different way to create the character. Euro problems.

    https://en.wikipedia.org/wiki/Unicode_equivalence

    Byte length of the character is irrelevant as long as you’re not doing something ridiculous like intentionally parsing your input in binary and blithely assuming that every character must be 8 bits in length.

    There is always some son-of-a-bitch who doesn’t get the word.

    • John F. Kennedy