Software complexity

Software is among mankind’s most complex creations. How do we measure software complexity and does it even make sense to do so? When I think of the most complex structures we have created, the arts (especially music), and … software come to mind. From an economic perspective, we consider software complexity to be a bad thing because the more complex the software, the more time and expense it takes to build and maintain. But sometimes it needs to be complex in order to solve complex problems. There can also be an aesthetic beauty to software that only developers seem equipped to appreciate. I recall once looking at code that controlled a spacecraft and being blown away not only by its complexity but also how by how very well written it was.

Music is different in that how complicated it is seems to affect people in different ways. The high level of complexity in a Mozart symphony seems to contribute to its immortality. On the other hand, simple popular music tunes have a wide fanbase.

Complexity of Music

In his 1933 book Aesthetic Measure¹, preeminent American mathematician George David Birkhoff proposed a mathematical theory of aesthetics. In the course of writing the book, he spent a year studying art, music, and poetry of various cultures around the world. He developed a formula to measure the aesthetic quality of an art object (e.g., a work of music) as being the ratio between its order and complexity. Since that time, researchers have built upon his work to come up with other ways of analyzing complexity of music. Mandelbrot’s protégé Richard Voss together with John Clark applied fractals to mathematical analysis of music². April Pease and her colleagues extended this work by searching for the presence of crucial musical events based on an analysis of volume and using this as a measure of complexity³. I find it interesting that in music complexity, the performance is measured, not the static sheet music (or electronic equivalent). Music played by a computer reading sheet music has been found be less complex than a performance by accomplished musicians!

The software profession has struggled with how to measure software complexity for decades. Thomas McCabe came up with the idea of using Cyclomatic Complexity to measure the number of logical paths through code⁴. But this has been shown to not be any better than just counting source lines of code (SLOC). Two methods currently in use are a set of six metrics proposed by Shyam R. Chidamber and C.F. Kemerer specifically designed for object-oriented code⁵, and a different set of six metrics proposed by Maurice H. Halstead⁶.

Comparison of Software Complexity Metrics

Chidamber and Kemerer Metrics Note that these metrics are per class so you would sum them up for all classes in the program	Halstead Metrics
WMC – weighted methods per class is the sum the complexities of each class method but they used a complexity of 1 for each method so this is really just the number of methods in a class	Program Vocabulary n = n₁ + n₂ n₁ – number of distinct operators n₂ – number of distinct operands
CBO – coupling between object classes is the number of other classes which are coupled (using or being used)	Program Length N = N₁ + N₂ N₁ – total number of operators N₂ – total number of operands
RFC – response for a class is the number of methods called by each class method summed together	Calculated Estimated Program Length = n₁ log₂ n₁ + n₂ log₂ n₂
NOC – number of children is the sum of all classes that inherit this class or a descended of it	Volume V = N x log₂ n
DIT – depth of inheritance tree is the maximum depth of the inheritance tree for this class	Difficulty D = (n₁ / 2) x (N₂ / n₂)
LCOM– lack of cohesion of methods measures the intersection of the attributes used in common by the class methods	Effort E = D x V

Measuring Software Complexity

So why should we care about measuring software complexity? Here are some claims, in many instances being made by companies that are selling complexity measurement products or consultants that will help you figure out how to use them.

Better estimates on software maintenance effort

I suppose if you had enough empirical data to somehow relate complexity to maintenance cost, this might be useful. There certainly have been a lot of studies on this. The problem is that there are other significant factors that affect software maintenance costs like

the number and type of new or changed user requirements concerning functional enhancements
the amount of adaptation that needs to be done to support a changing environment (e.g., Database, Operating System)
the amount of preventative maintenance that needs to be done to improve reliability or prevent future problems

Monitoring complexity so as to keep it lower, saving cost and reducing risk

So, do we have the Software Development Manager tell developers that their LCOM or DIT is too high and they need to fix it? Really? I suppose this could be an indicator that could be used to focus code reviews (i.e., spend more time reviewing code that has higher complexity) but I don’t see that there would be much value in doing this especially if your team is already doing effective code reviews with good coverage.

Using complexity as criteria for deciding to refactor or rewrite software

The suggestion here is that a software development manager would monitor the complexity across the codebase and when a module gets above a certain threshold, a decision to refactor or rewrite would be considered. My experience is that the development team already knows which sections of code are the best candidates for refactoring based on the effort required to maintain them. I’d trust that measure much more than a complexity metric. Another consideration is how often complex code needs to be touched. I’ve worked in organizations where we had a large, complex legacy codebase that we didn’t touch, just wrap it with a façade or adaptor.

My view on software complexity is that there is little if any value in measuring it outside of academia. There is a hospital sketch in Monty Python’s Meaning of Life movie where doctors call for the operating room to be filled with the most expensive equipment in order to impress the administrators should they drop in for a visit. John Kleese specifically asks for staff to bring in “the machine that goes ‘Bing’.” A software complexity dashboard would seem to have equivalent utility.

¹ George D. Birkhoff, Aesthetic Measure, Harvard University Press, 1933.
² R.V. Voss, J. Clarke, 1/f Noise in Music and Speech, Nature, 258 (1975).
³ April Pease, Korosh Mahmoodi, Bruce J. West,Complexity Measures of Music, Chaos, Solitons and Fractals 108 (2018) 82–86.
⁴ McCabe (December 1976). “A Complexity Measure”. IEEE Transactions on Software Engineering (4): 308–320.
⁵ Chidamber, S.R.; Kemerer, C.F. IEEE Transactions on Software Engineering Volume 20, Issue 6, Jun 1994 Page(s):476 – 493.
⁶ Halstead, Maurice H. (1977). Elements of Software Science. Amsterdam: Elsevier North-Holland, Inc. ISBN 0-444-00205-7.

Author: Steve Kowalski

Chief Technology Officer (CTO) - SaaS, Cloud, Agile View all posts by Steve Kowalski