irridiance's comments

irridiance · 2025-05-15T10:35:26 1747305326

I transform hundreds of tabular sources. For the cleaning / transformation, I found that a very small number of transformations is required, and that we need to review them as a team including business owners. So, I wrote a simple grammar that is very English-like; that gets translated into Polars operations under the covers in Python. It covers 98% + of my ingestion needs, and means that we focus on the needs of the logical data transformations as a team. Business users can easily make changes for sources they manage.

One of the concepts is a “map”, for old values to new values. Those we keep in Excel in Git, so that business users can edit / maintain them. Being Excel, we’re careful to validate the import of those rules when we do a run, mainly to indicate where there’s been a lot of change to identify where there might be an unintended change. Excel makes me nervous in data processing work in general (exploration with Pivots is great, though I’ve moved to Visidata as my first tool of choice). But for years of running in this way we’ve worked around Excel lax approach to data, such as interpreting numerical ID fields as numbers rather than strings.

For output “rendering”, because everything is in Polars, we can most frequently simply output to CSV. We use Jinja for some funky cases.

irridiance · on Nov 24, 2024

I think most of these are extremely poor. They can only be interpreted in many cases if you already understand the data, such as by reading the table first.

t-writescode · on Nov 24, 2024

I believe that's actually the point! Choosing the right way to display information is a skill all on its own!

blueboo · on Nov 24, 2024

Sure, but it’d be a lot more interesting and challenging to build a 100 visualizations where each gives a unique insight of the same dataset. An isometric 3d bar chart is just going through the motions.

Next, 1 essay, 100 fonts!…

fifilura · on Nov 24, 2024

See this as a form of brainstorming.

Next session is about ranking and discarding.

sam_goody · on Nov 24, 2024

From my POV this is worth bookmarking - there are many datasets that are much clearer with one chart type or another - having 100 styles with the same data will later offer a visual index to help me decide what will best serve my needs.

dre85 · on Nov 24, 2024

My thoughts exactly! At least half of these are chart types that I've never seen before or at least would never think of using so having this reference is awesome.

ijidak · on Nov 25, 2024

Agree 1000%!

On that note, I'm looking for an Encyclopedia of Visualizations. Something like this, but even more comprehensive.

I don't want fluff words. Just a visualization reference.

For each visualization I would want, at a minimum, an Example and the Name (and any alternative names).

Print would be awesome. So that I can flip through visualizations.

But, a website would be even better.

Aware of anything like this?

irridiance · on July 31, 2024

I write a lot of documentation, knowing that it may be nobody else who reads it. Why? Because when I take the time to write clearly, I think clearly. It’s for my productivity and effectiveness, first.

irridiance · on April 20, 2024

We’ve been developing niche medical software successfully for some decades.

First, it helps that it’s niche—it avoids the “make healthcare better with electronic healthcare records” space, which can only but descend into making a much of text boxes available on a screen and promising that AI will do… something…

Second, we will listen to our clients, and probe their needs. But we’re most successful when we observe our clients. When we’re not in the thick of it, we have more space to ask “does it have to be this way?” We work very hard to formulate the problem so that a piece of software is not the default solution.

Few of the pain points are “exciting” or “glamorous”. But anything that means the practitioner is spending more time with the patient is a big win, even if it means applying some very boring technology.

Best of luck.

yr1337 · on April 23, 2024

Thank you! I tip my hat to you for being successful in this difficult space.

irridiance · on Aug 8, 2023

My mosquito net. One of the best things I ever bought.

irridiance · on July 21, 2023

Limbo. One of the most moving games I’ve ever played.

King’s Quest III. I think, as a young boy, I related more to the character than in King’s Quests I and II. I really got lost in the world.

Infocom’s Planetfall. Also, lost in the world.

irridiance · on July 7, 2023

Good fun. I think, though, that precision in language might be a challenge. Some previous comments I concur. Over and above those, I was very strict not to infer anything outside the minimum of what was said. For example, “I was in bedroom from 10:00 to 10:15” does not imply that “I was not in the bedroom before or after that time”. Or, “I didn’t see anyone when I arrived” only means I saw no one in the destination room, not that there wasn’t someone in the kitchen (that I must have walked through) or the source room. Illogical that the murder could have happened up to 11:15, at exactly the same time that the police arrive—unless the victim phoned it in. These rules left ambiguity.

galapago · on July 7, 2023

Thanks for the feedback! I agree that the rules and explanations needs to be improved. I don't like the ambiguity and I think the deductions you did are indeed, correct. I should focus on improving the language in general, since English is not my native tongue.