Saturday, December 14, 2024

Q&A: Fixing the problem of stale characteristic flags

As we noticed final week with what occurred because of a nasty replace from CrowdStrike, it’s extra clear than ever that corporations releasing software program want a approach to roll again updates if issues go improper. 

Within the most up-to-date episode of our podcast, What the Dev?, we spoke with Konrad Niemiec, founder and CEO of the characteristic flagging software, Lekko, to speak concerning the significance of including characteristic flags to your code, but additionally what can go improper if flags aren’t correctly maintained.

Right here is an edited and abridged model of that dialog:

David Rubinstein, editor-in-chief of SD Occasions: For years we’ve been speaking about characteristic flagging within the context of code experimentation, the place you may launch to a small cohort of individuals. And in the event that they prefer it, you may unfold it out to extra folks, or you may roll it again with out actually doing any harm if it doesn’t work the best way you thought it might. What’s your tackle the entire characteristic flag state of affairs?

Konrad Niemiec, founder and CEO of Lekko: Function flagging is now thought-about the mainstream approach of releasing software program options. So it’s undoubtedly a observe that we wish folks to proceed doing and proceed evangelizing.  

After I was at Uber we used a dynamic configuration software known as Flipper, and I left Uber to a smaller startup known as Sisu, the place we used one of many main characteristic flagging instruments in the marketplace. And after I used that, though it allow us to characteristic flag and it did resolve a bunch of issues for us, we encountered totally different points that resulted in danger and complexity being added to our system. 

So we ended up having a bunch of stale flags littered round our codebase, and issues we wanted to maintain round as a result of the enterprise wanted them. And so we ended up in a state of affairs the place code turned very tough to keep up, and it was very laborious to maintain issues clear. And we simply ended up inflicting points left and proper.

DR: What do you imply by a stale flag?

KN: An implementation of a characteristic flag typically seems to be like an if assertion within the code. It’ll say if characteristic flag is enabled, I’ll do one factor, in any other case, I’ll do the previous model of the code. That is the way it seems to be like whenever you’re really including it as an engineer. And what a stale flag will imply is the flag can be all the best way on. So that you’ll have absolutely rolled it out, however you’re leaving that ‘else’ code path in there. So that you mainly have some code that’s just about by no means going to get run, nevertheless it’s nonetheless sitting in your binaries. And it nearly turns into this zombie. We wish to name them zombie flags, the place it sort of pops up whenever you least count on them. You suppose they’re lifeless, however they arrive again to life.

And this typically occurs in startups which are attempting to maneuver quick. You wish to get options out as quickly as attainable so that you don’t have time to have a flag clear replace and undergo and categorize to see for those who ought to take away all these things from the code. And so they find yourself accumulating and doubtlessly inflicting points due to these stale code paths.

DR: What sort of points?

KN: So a straightforward instance is you’ve got some kind of untested code based mostly on a mixture of characteristic flags. Let’s say you’ve got two characteristic flags which are in the same a part of the code base, so there are actually 4 totally different paths. And if certainly one of them hasn’t been executed shortly, odds are there’s a bug. So one factor that occurred at Sisu was that certainly one of our largest prospects encountered a problem after we mistakenly turned off the improper flag. We thought we had been sort of rolling again a brand new characteristic for them, however we jumped right into a stale code path, and we ended up inflicting a giant problem for that buyer.

DR: Is that one thing that synthetic intelligence may tackle as a approach to undergo the code and counsel eradicating these zombie flags?

KN: With present instruments, it’s a very handbook course of. You’re anticipated to simply undergo and clear issues up your self. And that is precisely what we’re seeing. We predict that generative AI has a giant function to play right here. Proper now we’re beginning off with easy heuristic approaches in addition to some generative AI approaches to determine hey, what are some actually sophisticated code paths right here? Can we flag these and doubtlessly convey these stale code paths down considerably? Can we outline allowable configurations? 

One thing we see as a giant distinction between dynamic configuration and have flagging itself is you could mix totally different flags or totally different items of dynamic conduct within the code collectively as one outlined configuration. And that approach, you may scale back the variety of attainable choices on the market, and totally different code paths that you must fear about. And we expect that AI has an enormous place in bettering security and decreasing the danger of utilizing this sort of tooling.

DR: How extensively adopted is the usage of characteristic flags at this level?

KN: We predict that particularly amongst mid market to giant tech corporations, it’s most likely a majority of corporations which are presently utilizing characteristic flagging in some capability. You do discover a good portion of corporations constructing their very own. Usually engineers will take it into their very own palms and construct a system. However typically, whenever you develop to some degree of complexity, you rapidly notice there’s so much concerned in making the system each scalable and likewise work in quite a lot of totally different use circumstances. And there are many issues that find yourself developing because of this. So we expect it’s a great portion of corporations, however they could not all be utilizing third-party characteristic flagging instruments. Some corporations even undergo the entire lifecycle, they begin off with a characteristic flagging software, they rip it out, then they spend important effort constructing comparable tooling to what Google, Uber, and Fb have, these dynamic configuration instruments.


You may additionally like…

Classes realized from CrowdStrike outages on releasing software program updates

Q&A on the Rust Basis’s new Security-Essential Rust Consortium

Related Articles

LEAVE A REPLY

Please enter your comment!
Please enter your name here

Latest Articles