[Confinement] Week 2

As everyone settles down in the new mode of operations, the number of small tasks has increased and the number of big projects has decreased.

The plagiarism tool is in testing among some of the teachers at school, and the funny reaction of my team of developers asking for an API (to avoid going through the web front end that I crafted - probably badly - in React) made me smile.

What fascinates me overall is the inability of "the web" to cope with the sudden influx of having a ton more people working from home. "They" said the web would replace everything, that it was just a matter of scaling up.

Azure seems to be full, GCloud has some issues with the data traffic, AWS is holding but the status page keep showing outages...

Don't get me wrong, I've been working remote for close to 20 years, so I'm not saying office work is better. But I have been working on projects with people who said it didn't matter if the performance was poor, because they'd just order a bigger server or two.

That inability to take into account the physical constraints of our world is one of the things that grind my gears the most: I've been working on embedded software and high-performance backend stuff for a long time, and betting on poor code hygiene to be compensated by someone else is never a good bet. It ends up with re-writing the code again, and again, and again.

When it's not the RAM issues (lookin at you Electron), it's server constraints (oh the surprise when your instance autoscales up), or bandwidth issues (our government is thinking of restricting the use of Netflix and the like 🙄).

This situation will hopefully remove the attention from the people who can talk and present the best, and back onto more objective metrics (aka "does it work under load?")

I don't rent a small server because I'm cheap. I do it because I can't release any software that doesn't work "correctly" on the bare minimum of specs I have decided the users will have. Then again, when it explodes, it's a great opportunity to learn new things about optimization and constraints 🧐

Since the situation will last a while longer, I hope it reminds everyone that what we do isn't magic. It's science, and we can't wave the problems and constraints away.


Random Wednesday

I had absolutely no idea that /dev/random was so controversial

> That's all good and nice, but even the man page for /dev/(u)random contradicts you! Does anyonewho knows about this stuff actually agree with you?

No, it really doesn't. It seems to imply that /dev/urandom is insecure for cryptographic use, unless you really understand all that cryptographic jargon.

Sick burn

From Myths about /dev/urandom


[Confinement] Week 1

Busy busy week, although not really that hard for people who are used to freelance - and working from home.

Confinement is currently imposed by our government, and even though you hear here and there that it's an affront and whatever, make no mistake: we are testing our social bond by doing this. I am not at risk. Most of my friends are not at risk. We catch COVID-19, we run a fever, and we heal.

Near me, people at risk include my grand father, whose health is fluctuating, and my niece who will soon turn 3. We do it for them, not for ourselves. Now, if they caught it, a decently stocked hospital could save them. But here's the kicker: there are only so many beds in hospitals, and I don't want to put the people working there in the position of deciding whether or not my grandpa or my niece deserve to live, compared to someone else's grandpa or niece.

So we confine ourselves to slow the infection rate, so that hospitals could cope if my loved ones catch that virus, and so that people there don't have to choose who lives or dies. It's that simple really.

Now, it does present some challenges: all the classes at school must be remote-taught and you know how well that goes. Despite the fact that they save a ton of transit time, students don't work more. I'm not going to go on a rant about who students should be working for (hint: themselves), but I will let you imagine how much more energy is needed to keep them focused on learning rather than, say, watching videos online or playing games.

Of course, that means working on new means of coercion 😇

And so, I decided to put money where my mouth is, and since I teach some of them how to develop a backend system with Kitura, I build a plagiarism detector for the work they hand in. Everyone who knows me will tell you how much I hate web front development, but hey, confined means more time to learn new tricks.

Technologies used:

  • REDACTED: for plagiarism detection, CLI tool
  • Kitura: for backend development, resource management and rendering
  • PostgreSQL: for session management, data storage and retreival, in conjunction with Kuery
  • React: for front end development
Things learned from REDACTED

Plagiarism detection is hard. It is absolutely not about diffing files, but is more about turning the documents into a tree-like structure and comparing the branches and the leaves.

It's slow and inefficient, and it's probably not going to work against smart plagiarists. But hey, we'll see how many of my students read my blog 😜

Things learned from Kitura

I actually know Kitura quite well, having worked with it (and taught it) pretty much since its inception. I know some of its "competitors" (if such a term could be applied to open-source frameworks) quite well as well, but this one is the cleanest I know. It has very few dependencies, idiosyncrasies and pitfalls.

Even though IBM has decided to stop contributing in an official capacity, I still think it's the best fit for me, and that somewhat large project has comforrted me in that belief.

Things learned from PostgreSQL

It's the best open-source database software, bar none. Don't change postgres, I've been with you for 20 years, and I still love you very much.

Things learned from React

Where to start? I don't like the way modern frontend dev works. To my old eyes, they seem to be reinventing stuff we always had, in a worse way.

The component based approach is definitely better than doing plain HTML/CSS/JS, but it's so... heavy. There are hundreds if not thousands of dependencies, none of which I can audit for performance or security in a finite amount of time, and it's all running on an engine that's millions of not billions of lines of code. It's just too big to grasp, and too hard to debug.

It works, don't get me wrong, and when you do manage to get it to display things the way you want, it can do marvelous things... I am not dissing the web, per se, just lamenting that it emphasized the two things that I ask my students to avoid doing at all costs:

  • copying stuff you don't understand
  • relying too much of third party dependencies

But hey, got drag and drop upload, session history, and visual code comparison working, so I guess there's that. It was also the #1 time sink on this project.

Infrastructure

I decided to host and deploy that thing on my own server to test my chops as a sysop. Way back when, I contributed to open-source OSes, and I've always fancied myself above average when it comes to server management. That being said, I've had no formal training in the matter and I know I do some... ad-hoc stuff.

Managing a multi-site server has come a long way with Docker and docker-compose, but it's still not 100% easy. Especially if you need to add SSL certificates, which I do, because I'd like the contents of the files to have a modicum of security in transit.

If you need to have a docker + nginx proxy + let's encrypt certificates, I strongly suggest reading this documentation which will help you tons.

Can I Haz Ze Softwarez?

Only if you ask nicely. This is not going open-source for now, but I can offer it to other teachers/schools who face the same situation as me. Reach out and let's talk.


[HoledRange] Sequences and Transformations

HoledRange has reached v 0.1.3 with two additions:

  • Iterating over ranges (contributed by Hugo a.k.a. Zyigh)
  • Transforming ranges into other ranges
Transformations

Simple idea, full of tricks: how do I apply a function f to transform a domain?

  • It has to result to a possible bound value for the domain (ie Hashable & Comparable)
  • We need to check the ordering of the new values in case the transformation flips the sign (eg [0;1] -> [-1;0])
  • We need to optimize the storage as we go in case the transformation collapses holes


[ML] The Nature of Intelligence

From The Gradient:

Current systems can regurgitate knowledge, but they can't really  understand in a developing story, who did what to whom, where, when, and  why; they have no real sense of time, or place, or causality.

The article itself is fascinating, and highlights quite accurately what one of the most advanced deep learning system out there can do in regards to language. Definitely a must read, whether you're convinced that we're a few epochs away from having Her-level AIs or that we are still worlds away.

As for me, every time someone talks about AI, I tend to respond "we can't reliably define intelligence... How could we create artificial intelligence?"