Are you achieving the aims of OOP?

Posted on 1st August 2019 by Tony Marston

Amended on 12th May 2024

Introduction

This question will probably confuse a lot of novice programmers as they do not realise that Object Oriented Programming (OOP) had any aims in the first place. It's just a style of programming, right? Surely the only aim to programming is the writing of programs, the creation of software, right? While those answers are partially correct they miss an important point - the production of software on its own is not a measure of success, it is how effective, particularly cost effective it is viewed as by the end customer, the user, the one who pays the bills. Different programming languages are created with different features and syntax in an attempt to offer something which is better than the alternatives. As explained in What is OOP? Object Oriented languages are supposed to better because of the following:

Object Oriented Programming is programming which is oriented around objects, thus taking advantage of Encapsulation, Inheritance and Polymorphism to increase code reuse and decrease code maintenance.

This sentiment is echoed in Designing Reusable Classes which was published in 1988 by by Ralph E. Johnson & Brian Foote where they say:

Since a major motivation for object-oriented programming is software reuse, this paper describes how classes are developed so that they will be reusable.

In this paper they describe how looking for abstractions can lead to a technique for building classes which they refer to as programming-by-difference, which can be summarised as follows:

Abstraction is the act of separating the abstract from the concrete, the similar from the different. An abstraction is usually discovered by generalizing from a number of concrete examples. This then permits a style of programming called programming-by-difference in which the similar protocols can be moved to an abstract class and the differences can be isolated in concrete subclasses.

Even though their paper was written with the Smalltalk language in mind (which is strictly typed) which means that some parts of it are irrelevant when it comes to PHP (which is dynamically typed), there are other parts which are still relevant, as explained in The meaning of "abstraction".

The more reusable code you have at your disposal then the less code you have to write, and the less code you have to write to get the job done then the less time it will take to get the job done.

Why is reusable code so important? The more reusable code you have at your disposal then the less code you have to write, and the less code you have to write to get the job done then the less time it will take to get the job done. The less time it takes to get the job done then the more productive you will be, and achieving high productivity is the best way to win friends and influence people. In the RADICORE framework, where each application subsystem is treated as a plug-in, each user transaction (task) will be comprised of the four different elements which are shown in Figure 1 below:

Figure 1 - A combination of the 3-Tier Architecture plus MVC

There is also a version of this structure which is more detailed.

So, how many of the above four components do you NOT have to write?

You do not need to write any of the 45 Controllers as they are built into the framework.
You do not need to write the single HTML View as that is built into the framework.
You do not need to write any of the Models as these are generated by the framework. 100% of the standard code is inherited from an abstract class while custom code can be inserted into any concrete subclass using any of the predefined "hook" methods.
You do not need to write any of the Data Access Objects as they are built into the framework.

The surprising thing is that I achieved this level of reusability by not following those ideas called "best practices". This is because when I started building my PHP framework in 2002 I did not know they existed. When I was made aware of them some years later I could see straight away that by amending my code to follow them I would end up with much more code and less reusability, so I decided to ignore them altogether. While I have identified a small selection of rules which I regard as being universally applicable, I have also identified some rules which I do not follow as I consider them to be an obstacle to reusability and therefore totally inappropriate. The remainder of this article documents the steps I took when building my framework in order to maximise the amount of reusable code which it produced.

Universal rules

These are the only rules/guidelines I have encountered which can be said to be universally applicable. See if you agree.

The primary "rule" that I have followed in all the languages that I have used is based on the following statement from Abelson and Sussman in their book Structure and Interpretation of Computer Programs which was first published in 1985:
Programs must be written for people to read, and only incidentally for machines to execute.
Martin Fowler, the author of Patterns of Enterprise Application Architecture (PoEAA) put it another way:
Any fool can write code that a computer can understand. Good programmers write code that humans can understand.
Here is another variation of that saying:
Any idiot can write code that only a genius can understand. A true genius can write code that any idiot can understand.

There are some people who seem to think that PHP is too verbose and seek to "improve" the situation by introducing a more compact syntax. In my humble opinion this would be a backward step. There is a word for the act of replacing proper human-readable words (plain English) with symbols, and that word is obfuscation.

This topic is discussed further in the following:
Try to follow is the KISS principle. This means that if you have a choice between a simple solution and a complex solution, then you should ALWAYS choose the simplest one as it will pay dividends in the long run. Another way of expressing this principle is as follows:
Try to achieve complex tasks using simple code, not simple tasks using complex code.

This is in complete contrast to most of today's programmers who seem to prefer the KICK principle.
Try to follow the DRY principle. This means that you should avoid having the same piece of logic duplicated in multiple places as any change to that logic, such as fixing a bug or adding an enhancement, would have to be duplicated in all of those places. If you missed out just one of those places then your application could behave in an unexpected manner. It would be better to put that logic into a reusable library so that it can be defined once and reused many times.
If it ain't broke don't fix it is a principle which originated in the world of mechanical/electrical engineering, but is just as applicable in software engineering. It means that once you have found a simple solution that works then don't try to fiddle with it to make it "better", "purer" or the mythical "perfect" as your efforts could have the opposite effect. The practice of constantly fiddling with working software just to make it "better" is attacked in When is Enough, Enough?
Don't violate the YAGNI (You Aren't Gonna Need It) principle. Don't insert a solution to a problem which does not exist in your code. Just because a new feature is added to the language does not mean that you should immediately change your code to implement this feature. Unless you actually have the problem for which the feature is a solution then adding this solution will be a complete waste of time and achieves nothing except add to code bloat. This idea is discussed in the following:
- Oooh Shiny! Magpies don't know what's just enough! by Matt Williams
- Does Your Team Have STDs? by Jared Richardson
Prevention is better than Cure is a principle which states that if you do something which causes a problem then instead of trying to deal with that problem a better solution would be not to cause that problem in the first place. A typical example is where the database is designed using one methodology while the software is designed using a different and totally incompatible methodology. Because they are incompatible they produce a problem known as Object-Relational Impedence Mismatch which is often solved by introducing an extra piece of software known as an Object-Relational Mapper (ORM). I avoid this problem by *NOT* designing my software using an incompatible methodology. Instead I design my database first, then build a separate class for each table which has knowledge of that table's structure. Each table class inherits common methods which correspond directly with the ubiquitous CRUD operations. By eliminating the mismatch I have eliminated the need for an ORM.
The idea of creating a separate class for each database table is actually recommended in the principle of Information Expert (GRASP) which states the following:

Assign responsibility to the class that has the information needed to fulfill it.

I interpret "responsibility" to mean code and "information" to mean data, so because each database table is responsible for a different set of data it follows that I should have a separate class to control access to that data, and that class should also contain all the methods which can act upon that data.

It has been my experience that the structure of the database takes priority over the structure of your code, and it is easier to structure your code around the database design than to structure your database around the software design. Eric S. Raymond, the author of The Cathedral and the Bazaar put it like this:

Smart data structures and dumb code works a lot better than the other way around.
This is the opposite view to that of Robert C. Martin (Uncle Bob) who, in his article NO DB wrote the following:
The database is just a detail that you don't need to figure out right away.
I prefer to ignore Uncle Bob and follow the principles identified in Jackson Structured Programming where it says
start with the data structures of the files that a program must read as input and produce as output, and then produce a program design based on those data structures, so that the program control structure handles those data structures in a natural and intuitive way.
Avoid putting all of your code into a single monolithic procedure as the different pathways through that code can be difficult to spot. DO NOT create a single object called "application" with a single method called "run". Instead take a large procedure which does lots of things into a number of smaller procedures each of which does a single thing. This is known as Modular Programming and makes it possible to produce a structure chart which is a great documentation aid. As well as being able to describe the structure of an individual program it should also be possible to produce a structure chart for the entire application, as shown in this diagram of the RADICORE framework, for example.
Try to achieve the best level of cohesion. High cohesion is better than Low cohesion. High cohesion is achieved when related functions are grouped together in the same module. Low cohesion is achieved when related functions are spread across different modules, or when a module contains functions which are not related. High cohesion can be achieved by producing modules which each have a single responsibility or concern. This can be done, for example, by implementing the 3 Tier Architecture or Model-View-Controller design pattern.
Try to achieve the best level of coupling. Loose coupling is better than Tight coupling. Loose coupling will help avoid the ripple effect where a change to one module leads to corresponding changes in other modules. This can be achieved by having high levels of polymorphism using stable interfaces which are resistant to change, and this in turn promotes the use of dependency injection.

All the other "rules" or "best practices" which I have encountered are mainly based on one of the above, but merely expressed in a greater level of detail. You try and get a bunch of programmers to define what "readable" code means and all you will do is start a never-ending debate on when to use uppercase or lowercase, when to use camel case, snake case or even Studly caps. The only common description of "bad code" which is offered by most programmers is code not written by me, which means that the topic is purely subjective and there will never be universal agreement.

Inappropriate rules

It was not until several years after I had starting publishing what I had achieved that some so-called "experts" in the field of OOP informed me that everything I was doing was wrong, and because of that my work was totally useless. When they said "wrong" what they actually meant was "different from what they had been taught" which is not the same thing. What they had been taught, and which is still being taught today, is that in order to be a "proper" OO programmer you must follow the following sets of rules:

When I examined these principles I took an instant dislike to them for the simple reason that by changing my code to follow them I would be destroying vast amounts of reusable code, so I decided to ignore them. I have documented criticisms of some of these rules in the following:

My critics fail to understand that sometimes I don't follow a particular practice/principle/rule simply because it is not appropriate for the type of applications which I develop. This could either make that practice completely redundant, and thus a violation of YAGNI, or produce a result which is not as optimum as it could be. If I can find a way that is better than your "best" then surely I should be applauded and not admonished.

Note that the sole measurement for judging what is "best" should be "that which produces the best results". This means that the software should be cost-effective so that where several pieces of software have the same effect the one with the lowest cost should always be regarded as being better. Note that "cheaper but less effective" does not qualify as software which is not effective should be regarded as being unacceptable no matter how low the price. With software development it is the cost of the developers time which is a crucial factor, and the best way to reduce the amount of time taken for a developer to write code is to write less code, which in turn can be achieved by utilising as much pre-written and reusable code as possible. Since the stated aim of OOP is to increase code reuse and decrease code maintenance then any practice which encourages the production of reusable code should be regarded as being better than any practice which does not. I encourage you to read Designing Reusable Classes which was published in 1988 by Ralph E. Johnson & Brian Foote for ideas on how this can be achieved.

Simply following a series of rules or common/standard practices is not enough on its own. Following rules blindly in a dogmatic and pedantic fashion and assuming that the results you achieve will be the same as those achieved by others is the path to becoming nothing more than a Cargo Cult Programmer. You have to analyse a problem before you can design a solution, then you need to decide which practices to apply and how to apply them in order to achieve the best results. By "which practices" I mean that some rules or practices may be inappropriate for various reasons. Trying to build a modern web-based database application using practices which were created by people who had little or no experience with such applications is unlikely to set you on the path to success. Most of the rules, practices and principles which I have encountered were written in the 1980s by academics using the Smalltalk language, or something similar, but how many of these people used these languages to write enterprise applications with hundreds of tables and thousands of screens? Also, practices designed for programs using bit-mapped displays are not relevant for modern programs which use HTML forms.

Below are some the rules, principles and practices which I consider to be inappropriate, so I ignore them:

Rules that were written for creating software which controls a single entity are totally irrelevant for software which does nothing but maintain information on hundreds of different entities in a database. A physical entity may have many operations to control it, but in a database application all that information is held in objects known as database tables. While an individual entity in the real world may have operations such as "turn on", "turn off", "start" and "stop" these are irrelevant in a database application. It does not matter how many tables you have or what information is held in each table, the only operations which can be performed on that information are Create, Read, Update and Delete (CRUD). It should be obvious that once you have written code to deal with one database table a great deal of that code can be applied to any other database table, so there is much more opportunity for writing code that can be written once and reused many times. The following rules will not produce the best results in a database application:
- Using the IS-A rule to create class hierarchies. All the objects in a database are of the same type (they are all tables) so share common operations but have different structures. The primary method in OOP of sharing code is to use inheritance, but this can cause problems if it is overused. This mistake can be avoided by only ever inheriting from an abstract class, as explained in Designing Reusable Classes.
- Using the HAS-A rule to create object associations, where an entity is composed of or associated with other entities. If you are only dealing with a single entity which has a fixed set of associations there is little scope for code reuse, but in a database application with hundreds of tables and hundreds of relationships there are more opportunities for spotting similar patterns of behaviour and creating reusable code for each pattern.
Rules that were written by academics who have no experience of writing web-based database applications for commercial organisations may be totally inappropriate and produce less-than-optimal results. Unless you know how databases work you will have no idea of how to write code that works with a database. Here are some examples:
- Designing your code structure according to OO theory and ignoring the way that databases are designed, as discussed in OO Design is incompatible with Database Design
- Using getters (accessors) and setters (mutators) to deal with data items one at a time, thus contributing to tight coupling, when the HTML and SQL sources provide data in arrays. This is discussed further in Getters and Setters are EVIL.
- According to OO theory database relationships are known as associations and must be dealt with by the controlling object calling special methods on each of the controlled objects. This is not the RADICORE way. I prefer to use standard methods which are supplied in the abstract table class instead of adding specialised methods to each concrete table (Model) class. All the code for dealing with related tables is therefore contained within specialised Controllers which can access related tables. This is explained in Object Associations are EVIL
- Databases do not have collections of finder methods as all that is necessary in an SQL query is a WHERE string and/or a HAVING string. Building objects with custom methods just to build these strings seems like overkill to me as it is easier to build them directly than to work out which methods to call to get the object to do it for you.
- Rules designed for software which responds to individual keystrokes and mouse events are not relevant to software which only responds to the pressing of the SUBMIT button by sending in groups of data items.
Rules devised for statically typed languages may be totally out of place in dynamically typed languages (refer to Object Interfaces are EVIL for an example).
Rules written by people who don't understand what they are doing are irrelevant. For example:
- The Liskov Substitution Principle is only relevant when you inherit from a concrete class to create another concrete class. This does not apply to my framework as I only ever inherit from an abstract class, and I never change any method signatures.
- The Interface Segregation Principle is only relevant when you use object interfaces, which I do not. They were invented to provide polymorphism without inheritance in strictly typed languages. PHP is dynamically typed and does not need anything to provide polymorphism without inheritance.
- program to the interface, not the implementation is meaningless as you cannot simply call an interface, you must call a method on an object which actually implements that interface. I have yet to see a code sample which proves that this idea has merit, and until I do I will dismiss it as bogus. If this is supposed to mean calling a method on an unknown object where the identity of that object is not provided until runtime, then as a description of how to use polymorphism is is pretty pathetic.
- Using the Dependency Inversion Principle to justify separating the line of code which instantiates an object with the line of code which calls one of its methods. This is a ridiculous overkill as discussed in Confusion about Dependency Inversion and Dependency Injection.
- Over-using dependency injection by injecting entities into entities when it was only supposed to be used for injecting entities into services. Building into your code the ability to swap to an alternative object, but not actually performing any swaps because there aren't any alternative objects, is a violation of YAGNI.
Rules that are described so badly that their meaning is vague and open to several interpretations. If a rule is not described using precise and unambiguous terminology this can lead to mis-interpretations which in turn can lead implementations which do not produce the desired effect.
- software entities should be open for extension, but closed for modification implies that, once deployed, you should not modify an object but extend it (using "extends" to create a subclass?), which sounds like it creates more problems than it solves. In the life of my framework I have performed numerous refactorings, and if I was forced to put each update in a separate subclass I would also have to change all references of the original class to the updated subclass. Refer to Open/Closed Principle for more of my thoughts on this matter.
- depend upon abstractions and not concretions is vague because there are so many different interpretations of the word abstraction. It took me years to realise that what this meant was to define methods in an abstract class which could then be inherited by numerous subclasses, thus providing polymorphism, which then gives you the ability to swap from one subclass to another at run time. But what if I don't have multiple subclasses? Refer to Dependency Inversion Principle for more of my thoughts on this matter.
- The Single Responsibility Principle stated that each software module should have one and only one reason to change, but "reason to change" was found to be so inadequate and confusing that Uncle Bob had to produce a follow-up article to explain that what he meant was the separation of GUI logic, business logic and database logic. This is, in fact, the same thing as the 3-Tier Architecture which is less confusing to implement as it has a more precise and less ambiguous definition.
  Although this principle was supposed to be based on the idea of cohesion the words reason for change were so confusing that a lot of developers tried to apply it based on a module's size instead of its contents. This meant that instead of a single class containing all the functions related to an entity those functions were scattered across multiple small classes. Instead of creating code with high cohesion they instead created the opposite, which is low cohesion.
Rules which have the same meaning but with different names, which leads some immature thinkers to believe that they are different rules. Examples are:
- The Single Responsibility Principle (SRP) and Separation of Concerns (SoC) which some claim to be different principles based on the following arguments:
- The Single Responsibility Principle (SRP) and the 3-Tier Architecture which both advocate the separation of GUI/Presentation logic, Business log and Database logic. When applied correctly they both produce the same result. Some immature thinkers believe that SRP is based upon size (a class should have no more than X methods and each method should have no more than Y lines of code), but this is never mentioned anywhere by Robert C. Martin. This idea proves that these people have developed the ability to count but not the ability to think, and I doubt very much that they can count above ten without taking their shoes and socks off.
Rules which are described as "best practices" imply there there is "nothing better", but to satisfy the aims of OOP the term "best" must be equated to "most reusability". For example:
- Saying that each Model must have its own dedicated Controller to handle all the tasks (events) which can be performed on that Model. My approach is the opposite - I have a series of pre-built and reusable Controllers which perform a single task on an unknown table, as documented in Transaction Patterns for Web Applications. This means that, using the power of polymorphism, I can link any Controller with any Model.
- Saying that each Model must be hand-crafted to contain a separate method for all the actions that can be performed on it. I recognised years ago that every task follows the same pattern by performing one or more operations on one or more database tables, and the only operations which can be performed on a database table are Create, Read, Update and Delete (CRUD). These operations can be provided by a standard set of methods which are defined in an abstract table class and inherited by every concrete table class (Model), so it is up to the Controller to call the relevant method using names which are accessible through polymorphism.
- Saying each task (event) must have its own method. As each task has its own identity this would require thousands of unique method names, and this would totally eliminate the benefits of polymorphism. In my implementation of MVC each task is a combination of a Controller and a Model where each database table has its own Model which can be accessed by as many different Controllers as is necessary. Each Model supports the same set of common table methods which allows polymorphism, and each Controller calls a different subset of these methods on what Model(s) it is given. Each task has a URL which points to a particular component script in the file system, and it is this script which identifies which Model(s), View and Controller are required to carry out that task. Each task has its own entry in the MNU_TASK table in the MENU database, and this allows for the easy construction of hierarchies of menu screens and the specification of Role Based Access Control
It should be obvious to every OO programmer that shared method names offer polymorphism while unique method names do not. Polymorphism provides the opportunity for more reusable code as it enables Dependency Injection.
Practices which were designed to solve a particular problem should not be followed unless you actually have that particular problem. Adding code that you don't actually need is a violation of YAGNI. Here are some examples:
- Front Controllers are a requirement of compiled languages where all application components are merged into a single executable program file. In this case you cannot jump directly to a component (sub-program) within that program file as that file has only one entry point. This single entry point must then contain code to identify which action is required so that it can call the relevant sub-program to perform that action. This redirection is handled in what is known as a router or dispatcher. Modern web languages such as PHP running under a web server such as Apache do not require a single executable program file as each component within an application can be provided with its own independent script in the file system, and each script can be accessed directly in the web server via the URL provided by the web browser. This means that all the functionality of a front controller is handled automatically by the web server and does not need to de duplicated in the application code. For details please refer to A minimalist approach to OOP with PHP - Front Controllers.
- Autoloaders were introduced into the language to get around the problem of manually inserting a long list of include statements at the beginning of each script. The simple fact is that I never have to write long lists of include statements as most of them are automatically handled by framework code. For details please refer to A minimalist approach to OOP with PHP - Autoloaders.
- Namespaces were added to the language with the specific objective of avoiding name collisions between code that you are in the process of writing and unknown code that that has already been written by a third party. This should exclude the PHP language itself as it has nothing with which it can possibly collide. Application code is written using the PHP language, so any naming collisions have to be fixed before that code will be allowed to run. The only genuine place for namespaces is within third party libraries which are developed independently of any application code and which may be added retrospectively into an existing code base using the Composer dependency management utility. For details please refer to A minimalist approach to OOP with PHP - Namespaces.
Rules newly created by programmers whose understanding of OOP is sadly lacking. Among such examples are:
- Inheritance Is a Procedural Technique for Code Reuse is complete rubbish (as explained in Composition is a Procedural Technique for Code Reuse) as procedural languages do not have inheritance. I should know as I used COBOL, the most famous and widely used procedural language of all time, for 16 years, and I never found any hint of anything which could be called "inheritance". All those people who claim that inheritance is bad simply do not know how to use it properly. I advise these people to read Designing Reusable Classes which was published in 1988 by Ralph E. Johnson & Brian Foote. In it they describe the technique of placing protocols which are common to a group of different classes into an abstract class so that they can be shared through inheritance instead of being duplicated. The use of an abstract class then enables the use of the Template Method Pattern which is a fundamental technique for code reuse in frameworks as it easily implements the Hollywood Principle (Don't call us, we'll call you).
- Objects should be constructed in one go which is an extension of the rule that it should not be possible for an object to exist in an inconsistent state where the word "state" is mistakenly taken to mean the data within an object when it actually means the condition of an object. The ONLY absolute rule regarding constructors is that after being executed the constructor should leave the object in a condition which will allow any of its public methods to be called. My full response to this schoolboy mistake can be found in Re: Objects should be constructed in one go.
- Constructors Must Be Code-Free is nothing more than a personal opinion of a person who has had problems with the code he placed in a constructor. That is down to poor coding on his part. In my own code I have a legitimate reason for such code, and this code does not cause problems if a class is extended. As for object composition - I never use it as I have learned how to use inheritance properly. I have a separate Model class for each database table where a standard set of methods is inherited from an abstract table class and where the constructor is used to populate the common table properties with the metadata which is appropriate for that table.
Novice programmers often confuse "common practice" with "best practice". Just because lots of people do something in a particular way does not make it the "best" way. If lots of people do something which is wrong does not make it less wrong, just as if lots of people commit a criminal act does not stop it from being a criminal act. I have been told many times that the practices which I follow are wrong simply because that they are not the same practices that everyone else follows, and that I should follow the same rules in order to be consistent. This to me is the wrong attitude. Following a practice which is bad just "to be consistent" would make my code nothing but "consistently bad". If it comes to light that there are several different ways of achieving something then an inquisitive programmer should examine these different ways and decide himself which one is best by seeing which one produces the best results or the fewest problems. For example:
- Every programmer is taught to avoid inheritance because of the Composite Reuse Principle (CRP) which is commonly known as favour composition over inheritance. When I first heard this I was confused because there was no explanation of what "composition" meant and why "inheritance" was bad. I had been using inheritance for many years and I had never encountered any problems, just vast amounts of reusability. It took me a long time to discover precisely what the so-called "problems" with inheritance actually were simply because everybody was simply echoing the rule without explaining why it existed. When I eventually discovered the whys and wherefores I was amazed to discover that all these people were making the same mistake by not understanding how to use inheritance properly, which is to only inherit from an abstract class. The idea behind OOP is the more reusable code you have at your disposal the less code you have to write. With inheritance I don't have to write any code beyond the "extends" keyword, but with composition I have to write code in my class to call methods in a different class. This is explained in Inheritance is NOT evil.
- Constructing and displaying an HTML form in one script but posting to another script. I saw several examples of this practice and took an instant dislike to it as I was used to creating a single program to send the form to the client VDU/CRT as well as receiving the form's input. This was simple and logical as it involved using the same data buffer to both send and receive. The act of using separate programs and therefore separate data buffers to send and receive could cause problems if there were differences between the two buffers.
- Using hyperlinks to activate different screens. On several sample scripts I encountered where a LIST screen showed multiple rows of data going across the page it was common practice to include one or more hyperlinks on each row so that the user could activate another screen to either show more details or perform a different action on that row. I chose not to follow this practice for the following reasons:
  - This uses the GET method which requires that the primary key of that database row be included in the URL, otherwise the activated screen will not know which row to work on. Including a row's primary key in the URL represents a security risk as a naughty user could easily change and of the values in the URL and operate on a different row.
  - Each hyperlink could only pass the identity of a single row to the activated screen, so if the user wanted to perform the same action on a different row he would have to exit back to the LIST and press the hyperlink on that different row.
  - The number of hyperlinks you could add to each row of data could easily overflow the screen size, and you could end up with more hyperlinks than columns of data.
  - You would have to write code to add the same group of hyperlinks to each row but with different primary keys.
  I decided instead on a completely different approach - instead of hyperlinks on each row I would use buttons in a separate navigation bar. This has the following advantage:
  - It uses the POST method which sends the request back to the current form without changing the URL and therefore without exposing any primary keys to the user, thus avoiding any possibility that they can be changed.
  - When one of these buttons is pressed the form will POST to the script a list of row numbers which have been selected. This will be converted into a string containing the primary keys for each of the selected rows, and this string will be added to the $_SESSION data in an new area which is set aside for the new screen. When that new screen is activated it would access this area and use this string of primary keys as the $where argument in the call to the getData() method.
  - None of these buttons takes up any room on any row in the screen.
  - I added a single checkbox at the front of each row which could allow the user to select any number of rows before pressing a button.
  - In the newly activated screen there would be a scrolling area which would allow the user to scroll back and forth through the selected rows without having to exit back to the LIST screen beforehand.
  - The developer does not need to write any code to display these navigation buttons, nor deal with the action required when a button is pressed as it is all handled by standard code within the framework.
  This may seem to be a complicated method which requires a great deal of clever code, but it is nothing more than a series of different steps each of which I accomplished using simple code. I only had to write this code once and build into into the framework, and since then it has been used in thousands of different screens, so I saw all that effort as a one-time investment which has repaid itself many times over.
Avoid practices which promote low cohesion. High cohesion is achieved when related functions are grouped together in the same module. Low cohesion is achieved when related functions are spread across different modules, or when a module contains functions which are not related. A typical cause of low cohesion, which is also a violation of encapsulation, is when gullible programmers mis-apply the Single Responsibility Principle and end up with too much separation in which related functions which should exist in a single class are spread across multiple classes. Don't forget that the more method calls you have the more coupling you have, and too much coupling makes it difficult to understand the program flow because of all those jumps from one method to another. I personally find it easier to read 10 lines of code in a single block than to follow 10 method calls each executing a single line of code.
Avoid practices which promote tight coupling. Coupling occurs when one module calls another, and tight coupling means that changes to one of those modules forces a ripple effect of changes to the other. The more coupling you have the more modules you must change in order to deal with this ripple effect.
The most common cause of tight coupling which I see promoted on a regular basis is the practice of having a separate property for each column of data in a table class. This then requires a separate "getter" and "setter" for each column, which means that changing the number of columns in a table requires changes to that table's class as well as all the places which use that class. As all data, either from an HTML form or the database, is originally presented as an array I find it easier to keep that data in that array without the need to split it into its component parts and then have to deal with each part separately. This means that I can change the contents of the array at will without having to amend any method signatures

PHP was created to make it easy to create dynamic web applications, those which have HTML at the front end and an SQL database at the back end. I was involved in writing enterprise applications (database applications for commercial organisations) for 20 years before I switched to using PHP, so I knew how such applications worked. I had even created frameworks in two of those languages. All I had to do was to convert my latest framework to use PHP and the OO features which it offered in order to create as much reusable software as possible. The structure of my RADICORE framework can be pictured in Figure 1 above.

There is also a more detailed version available. This shows that the RADICORE framework uses a combination of the 3 Tier Architecture, with its separate Presentation layer, Business layer and Data Access layer, and the Model-View-Controller (MVC) design pattern. The following amounts of reusability are achieved:

Controllers - these are pre-built and supplied by the framework. There is a separate one for each Transaction Pattern. Those developers who are still hand-crafting a separate Controller for each Model have failed to recognise the repeating patterns which are obvious to the trained eye, and repeating patterns are the starting point for reusable code.
Views - these are pre-built and supplied by the framework. There is a separate one for HTML, CSV and PDF output. Those developers who are still hand-crafting a separate HTML document for each task instead of using a templating system need to improve on their pattern recognition skills.
Models - these are generated manually from the Data Dictionary after a table's details have been imported. They inherit all the basic code from the abstract table class leaving the developer with nothing to do but populate "hook" methods with custom code as and when necessary. Those developers who are still hand-crafting each Model class from scratch are making a rod for their own backs.
Data Access Objects - these are pre-built and supplied by the framework. There is a separate class for each supported DBMS (MySQL, PostgreSQL, Oracle and SQL Server). Those developers who are still creating a separate DAO for each table have failed to notice that all SQL queries, regardless of the table upon which they operate, follow the same pattern and can be constructed to operate on any table.

Note also that any Controller can be used with any Model (and conversely any Model can be used with any Controller) because every method call made by a Controller on a Model is defined as a Template Method in the abstract class which is inherited by every Model. This means that if I have 45 Controllers and 400 Models this produces 45 x 400 = 18,000 (yes, EIGHTEEN THOUSAND) opportunities for polymorphism and therefore Dependency Injection.

I was able to produce a single View module which can produce the HTML output for any transaction as a result of my choice to use XSL Transformations and a collection of reusable XSL Stylesheets. This is coupled with the fact that I can extract all the data from a Model with a single call to $object->getFieldArray() instead of being forced to use a separate getter for each column, as discussed in Getters and Setter are EVIL.

What is OOP?

I worked with several non-OO languages for over 20 years writing enterprise applications before I switched to using PHP in 2002 with its OO capabilities, and the first thing I needed to do was to find out what OOP actually meant and why it was supposed to be better than previous programming paradigms. The initial definition that I found at that time was roughly as follows:

Object Oriented Programming is programming which is oriented around objects, thus taking advantage of Encapsulation, Inheritance and Polymorphism to increase code reuse and decrease code maintenance.

These three characteristics can be described as follows:

Encapsulation	The act of placing data and the operations that perform on that data in the same class. The class then becomes the 'capsule' or container for the data and operations. This binds together the data and functions that manipulate the data. More details can be found in What is OOP? - encapsulation
Inheritance	The reuse of base classes (superclasses) to form derived classes (subclasses). Methods and properties defined in the superclass are automatically shared by any subclass. A subclass may override any of the methods in the superclass, or may introduce new methods of its own. More details can be found in What is OOP? - inheritance
Polymorphism	Same interface, different implementation. The ability to substitute one class for another. By the word "interface" I do not mean object interface but method signature. This means that different classes may contain the same method signature, but the result which is returned by calling that method on a different object will be different as the code behind that method (the implementation) is different in each object. More details can be found in What is OOP? - polymorphism

Some programmers insist that there is a fourth component called abstraction, but I disagree. This is a mental process which cannot be performed in code, but which helps you design code which can be reused.

Abstraction

The process of separating the abstract from the concrete, the general from the specific, by examining a group of objects looking for both similarities and differences. The similarities can be defined in an abstract superclass so that they can be shared by all members of that group while the differences can be defined in separate concrete subclasses.

More details can be found in What is OOP? - abstraction and The meaning of "abstraction".

Some people seem to think that encapsulation and abstraction mean the same thing - information hiding - which is a big mistake. Encapsulation is about placing data and the operations which act upon that data in a "capsule" called a "class" while abstraction is about refactoring classes in order to share common code by inheriting it from an abstract class.

I noticed in the PHP manual that these capabilities were added to the language without removing the ability to write procedural code, so it was possible to have a mixture of procedural and OO code in the same program, thus leaving it up to the individual programmer to decide which style was best in a particular set of circumstances. This lead me to the conclusion, as documented in What is the difference between Procedural and OO programming? that:

Object Oriented programming is exactly the same as Procedural programming except for the addition of encapsulation, inheritance and polymorphism. They are both designed around the idea of writing imperative statements which are executed in a linear fashion. The commands are the same, it is only the way they are packaged which is different. While both allow the developer to write modular instead of monolithic programs, OOP provides the opportunity to write better modules.

It was quite clear to me that the objective of using OOP was to take advantage of the additional features in such a way as to increase the amount of reusable or sharable code within your application and thus reduce the amount of code which you have to write yourself. An application consists of a number of different components (my ERP application currently has over 3,000) so it would not be unusual to find identical or similar pieces of code being used by more than one component.

What is the benefit of reusable code?

Why would increasing the amount of reusable/sharable code be of any benefit? If you have a piece of logic which is common to several components the novice's method of implementing this would be to duplicate that logic, using the copy-and-paste method, in each component. This produces multiple copies of the same piece of logic in multiple places, so if a bug is ever found in that logic, or it needs to be updated, you have to find every copy of that logic in order to update it. All you have to do is miss one copy and the result could be at best unexpected and at worst unpleasant. The correct way, as practiced by experienced programmers, is to follow the DRY principle and define that logic in a single place, such as a function or a method, and then reference that function or method whenever you wish to execute that logic. This provides two enormous benefits:

If you ever need to update the logic, due to a bug fix or an enhancement, you only ever need to change the code in one place. All references to that place will automatically pick up that updated logic.
When writing a new component that needs that logic the programmer does not need to create a new version of that logic, or copy that logic into his code, all he needs to do is call the relevant function or method. This presupposes the fact that the programming team has at hand a list of all the reusable functions in the library so they know the name of each reusable function and what it does.

It should therefore be obvious that the more reusable code you have then the less code you have to write. The less code you have to write then the less time it takes to complete a component. The less code you have to write then the less code you have to test as the reusable code will (should?) have already been tested. The less code you have to write in order to produce a component also means that you save time by not having to write as much code, and as time is money this also leads to lower costs. The ability to produce software in less time and at less cost than your rivals will always be a major factor in a competitive market.

How do you create reusable code?

When I came to start building my first PHP components I already had an architecture in mind which I had encountered in my previous language and which I saw could provide a solid basis for all future development. This was the 3-Tier Architecture with its separate Presentation, Business and Data Access layers. I had also encountered Extensible Markup Language (XML) and The Extensible Stylesheet Language Family (XSL), and I decided that I would create all my HTML pages using XSL Transformations. This meant that I had split my Presentation layer into two separate components thus producing an architecture that included the popular Model-View-Controller (MVC) Design Pattern. This combined architecture is shown in Figure 1 above.

In the following paragraphs I shall refer to various types of component using the names in the above diagram - Models, Views, Controllers and Data Access Objects (DAO). An application will contain a number of each of these component types which can be used in a variety of combinations in order to achieve different results. For example, a single Model may be referenced by any number of Controllers, and the DAO may be referenced by any number of Models. The most important layer is the Business layer, which is also known as the Domain layer, as this contains all the entities/objects and their individual business rules which are relevant to the application. The other components - the Controllers, Views and DAOs - are there as services which support the execution of the business rules.

Note here that I have introduced two categories of object. In his article How to write testable code the author identifies the following categories:

Entities	An object whose job is to hold state and associated behavior. The state (data) can be persisted to and retrieved from a database. Examples of this might be Account, Product or User. In my framework each database table has its own Model class.
Services	An object which performs an operation. It encapsulates an activity but has no encapsulated state (that is, it is stateless). Examples of Services could include a parser, an authenticator, a validator or a transformer (such as transforming raw data into HTML, CSV or PDF). In my framework all Controllers, Views and DAOs are services.
Value objects	An immutable object whose responsibility is mainly holding state but may have some behavior. Examples of Value Objects might be Color, Temperature, Price and Size. PHP does not support value objects, so I do not use them. I have written more on the topic in Value objects are worthless.

The components shown in Figure 1 above have been implemented as follows:

Controllers, Views and DAOs are services, and are application-agnostic.
Models are entities, and are application-specific.

All business rules for an application exist in and only in the Business/Model layer, with a different Model component for each entity which needs to be referenced by the application. There could be hundreds of different Models in a large application.

All the service components are application-agnostic which means that they do not contain any logic or other knowledge which is specific to any application. This means that they can be used with the Model components of any application without any modification. The framework contains sets of pre-written and reusable components for these services as follows:

There are 50 Controller components, one for each of the Transaction Patterns.
There are 3 View components, 1 each for HTML, PDF and CSV output.
There are 4 Data Access Objects, 1 each for MySQL, Postgresql, Oracle and SQL Server.

Step 1 - Encapsulation

You must first create classes before you can make use of inheritance and polymorphism, so what you do here has a direct bearing on how much potential for reusability you will eventually create. Get it wrong and you will have limited potential. Get it right and that potential could be enormous.

Encapsulation is the act of creating a class for something which has data (state) as well as procedures (behaviour) which can operate on that data. The class then acts as the "capsule" for that data and those procedures. In OOP the data is implemented as "properties" and the procedures are implemented as "methods". Please try to avoid falling into the trap of creating anemic objects which contain state but very little behaviour. This is contrary to the basic idea of object-oriented design which is to combine data and process together.

The biggest challenge for the novice programmer is to identify which parts of an application, the "things", which need to be represented as classes, and what methods to build into each of those classes.

RULE #1: what you do *NOT* do is create a single class called "application" with a single method called "run". Instead you identify the different "things" which are of interest to the those areas of the business which are to be handled by the application, and for each of these "things" you create a Model class which will exist in the Business layer. Each Model will have properties and methods, but you then need to create other components to control which methods are called and which which properties are accessed. These other components are called "controllers" for obvious reasons.
RULE #2: do not waste time by trying to design your software components first and your database last otherwise you will hit the problem known as Object-Relational Impedance Mismatch where the structure of the software components is out of sync with the the table structure in the relational database. One solution to this problem is to allow the mismatch but deal with it using an additional software component known as an Object-Relational Mapper (ORM), but to my mind this is just papering over the cracks instead of tackling the problem which is causing the cracks. A much better solution would be NOT to allow the mismatch in the first place, and this can be achieved quite simply by designing your database first, then designing your software around each database component.

Those new to OOP are so dazzled by the idea that OOP "lets you model the real world" that they try to model those objects which they perceive as existing within the real world. When designing something like an e-commerce application which deals with things called PRODUCTS, CUSTOMERS and SALES ORDERS they think it would be a good idea by designing classes for each of those three objects. They are told to leave the database design till later as it is less important, a mere "implementation detail".

It is a well known fact to every experienced database designer that sometimes the data for a single "thing" in the outside world will actually need to be split across more than one table in the database. This is a result of the process called Data Normalisation. For example, a sales order may initially be regarded as a single entity, but in a database it could require multiple tables such as ORDER_HEADER, ORDER_ITEM, ORDER_ADJUSTMENT and ORDER_ITEM_ADJUSTMENT. An experienced programmer would create a separate class for each of these tables whereas a novice would create an aggregate/compound class called ORDER which would handle every table associated with an order. Having a single class which is responsible for more than one database table surely breaks the Single Responsibility Principle (SRP), which is one of the reasons why I avoid that idea like the plague.

Step 2 - Inheritance

Inheritance is the ability to reuse the contents of a base class (superclass) to create a derived class (subclass). This leads to three important questions regarding the amount of code which can be reused through inheritance:

How many superclasses do you have?
How much code is in each superclass?
How many subclasses are there for each superclass?

The method taught to novice OO programmers to identify places where inheritance may be used is to carry out the IS-A test. Unfortunately the poor dears get this completely wrong. They look at an entity called CUSTOMER and say "A customer is-a person, so I must create a PERSON class and then extend this to create a CUSTOMER class". Likewise they say "We sell widgets, and as a widget is-a product I must first create a PRODUCT class and then extend this to create a WIDGET class".

If you follow the same path you will end up with a number of superclasses (PERSON and PRODUCT in the above example) and a number of subclasses (CUSTOMER and WIDGET in the above example). This to me is wrong on so many levels:

You could end up with a large number of superclasses.
The contents of each superclass could be quite small.
Each superclass could be inherited by only a small number of subclasses.

The end result of this approach would be a limited amount of inheritance, which in my opinion is a sign of failure. Not only that, problems can be caused by mis-using inheritance by extending one concrete class to create another concrete class, or by creating deep inheritance hierarchies. The solution to these problems was provided in the book Design Patterns - Elements of Reusable Object-Oriented Software which was first published in October 1994 where it says:

One cure for this is to inherit only from abstract classes since they usually provide little or no implementation.

Any experienced OO programmer should know that when looking to create an abstract class you first look for a group of business/domain entities which share a common set of characteristics. A programmer experienced with SQL will be able to immediately point to the following list of characteristics which are common to all database tables:

Each table has a unique name within its database. Note that an application may access tables across multiple databases. My ERP application, for example, contains over 15 separate domains/subsystems, and each has its own database.
Each table has a structure comprised of one or more columns. Each column name must be unique within its table.
Each column has a data type which is taken from a known list.
Each table has a primary key which is comprised of one or more columns which provides a unique identity for each record in that table.
Each table may have a number of additional unique keys which are known as candidate keys.
Each table may be linked to another table in what is known as a one-to-many or parent-to-child/senior-to-junior relationship. Each relationship is between two tables where the "one" is the "parent/senior" and the "many" is the "child/junior":
- The table may be the parent with many children.
- The table may be the child with many parents.
- In each relationship the child table has a foreign key which contains one or more columns which can be directly linked to the corresponding columns in the primary key of the parent table.
Note that a table may be related to itself, in which case it is both the parent and the child.
Regardless of a table's structure it it subject to exactly the same operations - Create, Read, Update and Delete. Each of these is constructed as a query string which follows a uniform pattern, so is a prime candidate for abstraction.

In the RADICORE framework the abstract table class contains a set of common table methods and a set of common table properties.

Only a novice programmer would fail to see the benefit of placing the code to deal with all those common characteristics in an abstract class which can then be inherited by any number of concrete classes. This also passes the IS-A test as it it quite plain to see that every object in the domain/business layer is-a database table. The abstract class deals with the common characteristics while the concrete class provides the details which are unique to a particular table. But how much code can I put into the abstract class? Quite a lot, actually. This answers a criticism of my approach which was given as long ago as 2003 where someone known as Jochen Daum said the following:

This means you write the same code for each table - select, insert, update, delete again and again. But basically its always the same.

This person was obviously a novice to OO as he failed to understand that instead of repeating the code to deal with the Create, Read, Update and Delete operations in each concrete table class you can define it just once in an abstract table class and then share it with every concrete table class by using inheritance. In my ERP application I currently have over 450 database tables, so if each of those 450 table classes inherits from the same abstract class then that is a lot of code sharing. As the rules for generating and executing SQL queries are exactly the same irrespective of which table you are handling this is covered by a pre-written Data Access Object, so there is no code duplication there.

I disagree with the notion, as contained in the above quote from the Gang of Four book, that abstract classes usually provide little or no implementation. If you have been writing database applications for as long as I have then you may actually find that the code used to communicate with a database table could be quite large. You may wish to intersperse the standard logic which constructs SQL queries with custom logic to handle the business rules, in which case a solution for this is given in the same Gang of Four book in the form of the Template Method Pattern which is described as follows:

Defines the skeleton of an algorithm in an operation, deferring some steps to subclasses. It lets subclasses redefine certain steps of an algorithm without changing the algorithm's structure.

This is where an algorithm/operation requires a series of steps comprised of a number of invariant methods which have concrete implementations defined in the superclass, and variant/variable/customisable "hook" methods which do not have implementations unless they are defined in the subclass. Every subclass then shares the same invariant methods but has its own set of variant/variable/customisable methods. By following the guidelines in the Gang of Four book I have been able to create an abstract table class which contains 220 invariant methods and 150 variant methods. That is a LOT of code which is being shared.

Step 3 - Polymorphism

It is not possible to take advantage of polymorphism unless you have the same method signature appearing in more than one class. These duplicate method signatures may appear by inheriting from the same superclass, but inheritance is not a requirement - it is possible to create several classes where the same method signatures are hard-coded instead of being inherited. How the duplicate methods get there is irrelevant, it is only the fact that they exist which matters. If you have a piece of code in object 'A' which calls a method in object 'B', but the same method is available in objects 'B1' to 'B99', you are then able to call that method in any of the 99 alternative objects. Although the method call is the same the object on which the call is made is different, so the results will be different as each of those 99 objects provides a different implementation of that method.

That explains the mechanics, but where can it be employed? I have already stated several important facts:

Every object in my domain/business layer is a database table which has its own class.
Every concrete table class inherits a great deal of sharable code from the same abstract table class.
Every concrete class therefore automatically contains the methods to perform the Create, Read, Update and Delete (CRUD) operations as these are inherited from the abstract class.

In addition my decades of experience with database applications has taught me the following:

Every task (user transaction) achieves its purpose by performing one or more CRUD operations on one or more database tables where these operations may be interspersed with business rules. Each table will have its own rules, and there may be rules which are specific to a particular task.
Each task may be categorised by its behaviour (what operations it performs) and its content (the objects on which it operates).
When you have written a large number of different tasks on a large number of different database tables you may see, as I have, that some of these tasks have the same behaviour but different content.

Given the above it should be possible to create an object which encapsulates that behaviour and makes the method call on an object whose identity is not known until runtime. If you look at Figure 1 you should see that the methods in each Model (business/domain object) are accessed from a Controller, but unlike most novice programmers who create a separate Controller for each Model where the Model name is hard-coded, in my framework I have Controllers which call a known set of methods on an unknown object using a technique known as Dependency Injection. Each of the tasks in my application has its own component script which is very small as all it does is identify which Model and View are to be used before it hands control over to the Controller. Because the Controller accesses methods which are defined in the abstract table class, and because that same abstract class is inherited by every concrete table class (Model), you should see that the same Controller can be made to work with any Model.

If my framework contains 45 Controllers, and each of these can be used with any of the 400 Models in my ERP application, this means that I have 45 x 400 = 18,000 (yes, EIGHTEEN THOUSAND) opportunities for polymorphism. Is that a lot of reusability or what?

Step 4 - Abstraction

The best description of abstraction that I have ever encountered was found in a paper entitled Designing Reusable Classes which was published in 1988 by Ralph E. Johnson & Brian Foote. In it they describe the process of abstraction as:

separating the abstract from the concrete, the similar from the dissimilar

This is not the same as encapsulation which involves the creation of individual classes as it involves looking at a group of classes after they have been created and looking for similarities and differences. If those classes share a similar set of protocols (operations or methods) then those protocols should instantly be regarded as candidates for being moved to an abstract class where they can be shared by members of that group through inheritance. Note that inheriting from an abstract class is the preferred method as it avoids those problems which can be encountered when inheriting from one concrete class to create a different concrete class.

This is explained in more detail in What is "abstraction"?

An abstract class is distinguishable from a "real" (concrete) class as it cannot be instantiated into an object. It can only be inherited by a concrete subclass which can then be instantiated, thus creating an object which combines the methods within the superclass with those defined within the subclass.

An abstract class may contain either abstract or non-abstract (concrete) methods. Abstract methods cannot contain an implementation and must be supplied in the subclass. Concrete methods may contain an implementation which may or may not be overridden in the subclass. An abstract class is an essential part of the Template Method Pattern as it allows for a series of steps containing a mixture of invariant methods and "hook" methods which can interrupt the processing flow with custom code, where each subclass can have its own custom code.

While inheritance on its own is a valuable technique for code reuse, the Template Method Pattern is an essential part of a framework as it implements the Hollywood Principle (don't call us, we'll call you). The Template Method Pattern is used extensively in the RADICORE framework as every method called by a Controller on a Model is a template method. The standard code defined in the abstract table class handles all the standard processing, but some of these methods have been deliberately left empty so that they do absolutely nothing unless they are overridden in a subclass with code which is specific to that subclass.

My use of an abstract table class, which then allowed me to implement the Template Method Pattern for every user transaction within the application, thus producing enormous quantities of reusable code, came about because of some simple observations:

That every table in a database is a separate entity which requires its own class.
That every table is subject to exactly the same CRUD operations.
That those CRUD operations could be turned into common table methods which could then be defined in an abstract table class so that they could be shared by any number of concrete table classes.
That those methods could be turned into Template Methods so that custom code could be inserted into any subclass by using "hook" methods.
That the use of common table methods, coupled with the use of a single $fieldarray for all application data instead of individual getters and setters, would produce vast amounts of loose coupling throughout the framework, thus contributing to its lower costs of maintenance.
That because each user transaction performs one or more CRUD operations on one or more tables, and these tables use exactly the same methods, the Controllers which call these methods could be made to function with any Model (table) in the application by using dependency injection. This is a by-product of polymorphism which in turn is a by-product of inheritance and loose coupling.

Other ways to write less code

Putting repeating code into reusable components so that it can be called multiple times instead of being duplicated multiple times is one way of writing less code, but there are other techniques which you can employ which allow you to achieve a result with less code. Examples which I use are as follows:

Creating wrapper functions.
When I first started learning PHP using sample code which I found on the internet and in books I noticed that when several steps were required to carry out a task that each step was called separately, such as load(), validate() and store(), which therefore required the same block of code to be repeated in different Controllers. This is a common mistake that can be avoided by placing that repeating group of function calls into a separate wrapper function, thus replacing multiple functions calls with a single call to that wrapper function. This also makes it easier to amend the contents of that wrapper function in the future should a change be made to the number of steps.

The observant among you might also notice that these wrapper functions, when placed inside an abstract class, are a perfect way to implement the Template Method Pattern. This then allows concrete subclasses to insert custom code at various points in the processing flow using "hook" methods.
Avoiding method names which are tightly coupled to particular Models.
Even though classes for database tables share exactly the same CRUD operations I have seen it suggested in numerous places that method names such as createProduct(), createCustomer() and createOrder() should be used. This idea produces tight coupling and destroys all possibility of polymorphism because it ties each Controller to a particular Model. By using a generic insertRecord() method in all table classes it then means that the Controllers can be reused with any Model using that mechanism known as Dependency Injection.
Avoiding moving data into and out of objects one property at a time.
Having separate variables for each table column is not a good idea as it requires a separate line of code to either "get" or "set" each column. This means that any object which uses those getters and setters cannot be shared with objects which do not contain those properties. Non-sharable code automatically produces tight coupling between the two objects which are exchanging data. This destroys polymorphism and with it the ability to use Dependency Injection. As data being sent in from the browser or retrieved from the database is presented as an associative array, it would surely make sense to leave this data in that array instead of having non-unique code to unpack the array into its individual elements so that each element can be transferred individually. Being able to transfer data without using named variables contributes to loose coupling, and this makes the set of common table methods within the abstract table class perfectly valid for every table in the database regardless of what columns they contain.

Creating reusable services

Using encapsulation, inheritance and polymorphism to create entity objects in the business/domain layer is one thing, but is it possible to create reusable service objects in the other layers? Unlike an entity which has state, data which can be accessed via multiple method calls, a service performs a single operation but does not have state, which means that it has to be provided with the necessary data on each method call. It could therefore be possible to create a single service object which can performs its function using any data instead of creating a different object for different sets of data. Below are examples of some of the reusable objects which exist in my framework.

Reusable Data Access Objects

I have seen more than one example on the internet where the novice programmer thinks that it is a good idea to create a different DAO for each table, but that is not how the DAO in the 3-Tier Architecture is supposed to work. In the first language that I used which incorporated a DAO this was an object which could deal with any table in a particular DBMS, which enabled the entire application to be switched from one DBMS to another simply by changing a single component. This behaviour is what I have duplicated in my PHP framework as I have available a separate DAO for each DBMS which I support. I started with MySQL, but later I added PostgreSQL, then Oracle and eventually SQL Server. This is made possible by the fact that each of the methods in the DAO which constructs and then executes the relevant query string includes in its arguments the database name, the table name, and the relevant column names with their values.

Some novice programmers question the idea of having a separate DAO as they think that once an application has been installed with a particular DBMS then it is highly unlikely that the DBMS will be changed. I would largely agree with that sentiment, but what about being able to choose the DBMS before the application is installed? I have developed an open source framework which can be downloaded and used by any team of developers, but I do not restrict it to be used with a single DBMS, I give the developer a choice. I have used the same framework to build a large ERP application as a package which can be used by multiple organisations, and this allows customers to choose the DBMS that they prefer before they install it.

Reusable Views

In a web application all the screens which are shown to the user are constructed in the Presentation layer as HTML documents which conform to a standard which is (or supposed to be) supported by every web browser. Each HTML document is simply a large string of text with pieces of data enclosed in HTML tags. As I had become familiar with the use of XML and XSL in my previous language, where an XML document contains nothing but data while an XSL stylesheet contains a number of templates which can transform that data into HTML, I immediately saw the benefit of employing the same process in my own framework. I thus created a single View component which could extract the data from the Model(s), put that data into an XML document, load in the designated XSL stylesheet, then perform an XSL Transformation to create the HTML output which is then returned to the client's browser.

The View object does not contain any Model names as it functions using Dependency Injection where it is given an array of one or more objects and it calls standard methods on each of those objects to extract whatever data that it/they contain. These standard methods are inherited from the abstract table class, so they do not have to be duplicated in any Model class.

In my original implementation I had a separate XSL stylesheet for each different web page as they each required different columns to appear in different places with different controls. After producing a number of these stylesheets I could see a large amount of code which was similar while the only difference was the list of column names and their HTML controls. After a bit of experimentation I managed to remove the need for a multitude of customised stylesheets and replaced them with a small library of reusable XSL stylesheets. I did this by adding a <structure> element to the XML document which contains enough information to allow the XSL stylesheet to construct the variable application area within each HTML page. The contents of each <structure> element is obtained from a small screen structure script which also identifies the XSL file which is to be used.

By examining a large number of different web pages I could see a series of patterns emerging, and I managed to create a single reusable stylesheet for each pattern. This means that when creating a new task with a web page the developer does not need to spend any time on the standard parts of each page as this is already handled by the library of XSL stylesheets provided by the framework.

Not all tasks produce output which is displayed in a web page. Some use a PDF document, some use a CSV file, while some do not have any output at all. They simply do something without the aid of any dialog with the user before returning control to the component from which they were called. Just as I have a single class which produces all HTML output, I also have single classes for all PDF and CSV output.

Reusable Controllers

The idea of having a small number of reusable Controllers is an impossible dream for novice programmers as the way they are taught to design their applications precludes this possibility. They are taught that when they have identified a task (user transaction) that they should create a class with a method whose name corresponds with that particular task. Using this methodology they end up with method names such as createCustomer(), createProduct(), createOrder() and createShipment(). This is a totally bad idea as it means that in a large application with 3,000 tasks you end up with 3,000 unique method names, and this totally eliminates any possibility for code reuse via polymorphism.

As explained in Step 3 above you need to have identical method signatures appearing in multiple objects in order to provide the opportunity for polymorphism. Once you have done this you can produce components which call one or more of these methods on any of these interchangeable objects using the mechanism of Dependency Injection. As each of the above methods has the end result of inserting a record into a database table you can replace all those Controllers which call those unique methods with a single Controller which calls the generic insertRecord() method on whatever table object is provided at runtime.

By recognising that each of the tasks (user transactions) in my application conforms to a pattern which may be repeated I have been able to create a library of reusable Transaction Patterns each of which has its own Controller script which calls a predetermined set of generic methods on one or more unknown Models (table classes). This means that instead of having a separate Controller for each Model which can only call methods which are unique to that Model I can have a separate Controller for each pattern of behaviour which can be applied to any Model. In this way I can apply the same pattern of behaviour to any table in my database by reusing the pattern instead of writing code to duplicate the behaviour.

I have so far identified and created 45 such Transaction Patterns which between them are used in over 3,000 tasks within my ERP application. Some of these patterns are used hundreds of times, some are used dozens of times while others are used only in rare circumstances.

Generating components

Having code which you can reuse means that when you are writing a new component you can call this reusable code instead of having to duplicate it. In some cases this can make the creation of a new component so simple that it can be automated, which means that instead of writing the code yourself you can have it generated for you by pressing a button. Below are some of the code generation facilities which are included in my framework.

Generating Classes

As pointed out previously my business/domain layer has a separate class for each database table. As a significant portion of the code which deals with the throughput of data from the Presentation layer to the Data Access layer and back again is essentially the same this has allowed me to identify a great deal of code which can be written once and then shared through inheritance from an abstract table class. Experienced database developers will immediately point out that each table has its own business rules, but these can be added in later as variant methods courtesy of my implementation of the Template Method Pattern.

Each table also has it own structure which provides the details for all those common characteristics, and it is these missing details which help to turn an abstract class into a concrete class. My previous language used an internal Data Model which was used to generate the necessary CREATE TABLE scripts, but in my PHP implementation I decided to do the reverse. Instead of maintaining the Data Model and then exporting to the database I import from the database into my version of the Data Model which I now call a Data Dictionary, then I export each table's data from the Data Dictionary into a series of PHP scripts which are then accessed at runtime. I originally created these PHP scripts by hand, but later I wrote a program to do it for me. In my Data Dictionary I built a process to export each table's data to two separate files:

<tablename>.class.inc - the concrete class for each database table. This inherits both the invariant (fixed) and variant (customisable) methods from the superclass. The variant methods are initially empty and only have to be defined in the subclass when an actual implementation is required.
<tablename>.dict.inc - the structure details for that database table which are incorporated into the object via the call to the loadFieldSpec() method in the class constructor.

The reason why I create two separate files is quite simple - after the class file has initially been created it may be amended later on to provide implementations for any of the variant/customisable methods, so this class file is never overwritten during the export process. It is also possible for the table's structure to change after it was first created, so all that is necessary is to re-import the changed structure and re-export the updated details. This will replace the contents of the structure file but not touch the class file.

Generating Tasks

As stated previously each task (user transaction), regardless of its complexity, performs one or more operations on one or more database tables. This is a mixture of standard code which is provided by the framework and custom code to handle the business rules for that table and/or task. By using the Template Method Pattern the standard code is implemented within invariant methods which are defined in the abstract table class while the business rules are implemented within the variant/customisable methods which are defined within each concrete table class. The actual behaviour of each task has been implemented as a series of Transaction Patterns which have been built into the framework.

After generating a class file it is then necessary to create the tasks which allow the user to put data into and get data out of that table. In my framework each task has an entry in the MNU_TASK table in the MENU database which points to a small component script in the file system. I do not have a single task which performs all the possible operations on a table, instead I create a family of forms where each operation is carried out by a separate task. This is for reasons discussed in Component Design - Large and Complex vs. Small and Simple. Again I stated off by doing this all by hand, but later on I wrote another program to do it all for me. Within my Data Dictionary I now have a process which allows me to select a database table, select a Transaction Pattern, then press a button to create all the necessary database records and scripts.

At this point each task is in a basic but runnable state. Although the table class does not yet contain any variant/customisable methods for any specific business rules, it has enough logic to put data into and get data out of the database. Even the validation of user input is taken care of by the built-in validation object which checks all data with the contents of the table's $fieldspec array.

If you have been paying attention you should have noticed that once I have created a database table I can import that table's details into my Data Dictionary, press a button to create the corresponding class file, then press another button to generate the tasks to maintain that table which are then immediately runnable. This entire process can be achieved in 5 minutes without having to write a single line of code - no PHP, no SQL and no HTML. If you cannot achieve the same level of productivity with your methodology then I would suggest that you need to examine your approach and lean how to shift it into a higher gear.

A final area of reusability

Apart from creating reusable code by utilising encapsulation, inheritance and polymorphism, it is also possible to create it by building a library of functions and subroutines. Some libraries can be quite small while others can be quite large, but did you know that there is something which exists at a higher level above a library? In case you are a novice programmer who is still groping around in the dark I shall enlighten you - it is a thing called a framework. If you don't understand the difference between a library and a framework please read What is a Framework?

Instead of having to write your own code to call library functions a framework will create code with basic functionality which you can then alter by extending or overriding the framework code by adding implementations to the variant/customisable methods which are available through my use of the Template Method Pattern. This is the mechanism by which the flow of control is dictated by the framework instead of the caller. The invariant methods in the abstract class are always called, and the empty variant/customisable methods can be overridden in each concrete class to supply additional behaviour.

A proper framework will also contain built-in components to handle that functionality which is common to all the domains/subsystems within the application, for example:

The framework allows a large application to be built from a mixture of separate domains/subsystems where new domains can be added at will.
The login screen is provided by the framework so there is no need to have a separate login for each domain.
An application with a large number of tasks will require a mechanism in which the selection of available tasks can be arranged into a hierarchy of menu pages. The framework will then provide the ability to create menu structures, display them and process user selections without the need to write any code.
Some tasks may be designated as child tasks to a parent task, in which case the child task will be shown on navigation buttons instead of menu buttons, but only when the parent task is active. The framework will then provide the ability to create navigation buttons, display them and process user selections without the need to write any code.
In an application with a large number of tasks which cover a number of different domains, as well as a large number of users, it is not normal practice to allow every user to access every task. The ability to restrict a user's access to a subset of the available tasks is provided by something known as an Access Control List (ACL) or Role Based Access Control (RBAC). The framework provides the ability to add as many different Roles as necessary, to allow Roles to access specified Tasks or Tasks to be accessed by specified Roles, and finally to link Roles to Users. After a user logs in each user will only be able to see, and therefore select, a task to which he has been granted access.
Other security features may be employed, such as:
- Restricting a task to one or more IP addresses.
- Restricting a user to one or more IP addresses.
- Restricting a user to particular hours with the working day.

Summary

The only definition of OOP which I find acceptable goes as follows:

Object Oriented Programming is programming which is oriented around objects, thus taking advantage of Encapsulation, Polymorphism, and Inheritance to increase code reuse and decrease code maintenance.

OOP is supposed to be better than earlier programming paradigms because it provides features which were designed specifically to increase code reuse and decrease code maintenance. The more reusable code you have at your disposal then the less code you have to write. The less code you have to write to get the job done then the less time it will take to get the job done, and getting the job done in less time will make you more productive. However, unless you take advantage of these features and use them appropriately to actually produce more reusable code then your efforts will be wasted.

I built my PHP framework based on 20 years of prior experience with developing database applications, with their associated frameworks, in two different languages. I taught myself to use encapsulation, inheritance and polymorphism without being sidetracked by inappropriate practices. I did not follow these practices simply because I did not know they existed. Instead I used my own experience and intuition to create a framework for writing web applications which contained as many standard and reusable components as possible, and this has made me more productive than I have ever been, and more productive than any of my critics.

The amount of reusable code which is provided by my framework equates to large volumes of code that I DON'T have to write. Examples of this code is as follows:

I do not have to write any Model classes by hand as the framework generates them for me. This is because I recognised the fact that each of the Model objects in my business/domain layer is a table in the database and not some mythical object in the real world. Every database table has a set of characteristics which are common to all database tables, and I have put all the code which deals with these characteristics into an abstract table class which is inherited by every concrete table class. The large amount of code in my abstract table class is therefore a large amount of code which I don't have to write again and again for each concrete table class.
I do not have to write any code to validate the user's input against the data type of the corresponding database columns as the framework does that for me. The contents of the $fieldspec array in the <tablename>.dict.inc file is used by the standard validation object before any data is written to the database.
I do not have to write a separate Controller for each Model as each Model has the same set of methods which are inherited from the abstract table class. Regardless of the purpose a table serves and the data it contains the only operations which can be performed on that table are the ubiquitous Create, Read, Update and Delete. Every task (user transaction) within the application will perform one or more of these operations on one or more database tables, and I have created a set of Transaction Patterns which each deal with a particular combination of operations/methods and number of Model objects. Each of these patterns comes with its own pre-built Controller, so in an application which contains 3,700 tasks each of which utilises a pre-built Controller that is a lot of Controller code that I do not have to write.
I do not have to write any HTML pages by hand as the framework generates them for me at runtime. It does this with a standard component which extracts the data from the Model(s), puts that data into an XML document, then performs an XSL Transformation using the designated XSL stylesheet. This "designated XSL stylesheet" is one of the 12 (yes, TWELVE) stylesheets which exist in my library of reusable XSL stylesheets. My ERP application has over 3,000 forms, so that is a lot of HTML code which I do not have to write.
I do not have to write any code to get a basic task up and running as every task is nothing more than a combination of a reusable Model, a reusable Controller and (sometimes) a reusable View. For each individual task in the application the identity of these components is provided in a component script. I do not even have to write this out by hand as the framework generates it for me. All I do is choose a table, choose a pattern, press a button, then run the task(s). With the basic processing taken care of I can then concentrate on the business rules by adding code into the variant/customisable methods within each Model class.

If writing less code is a laudable aim, then how about the ability to write no code at all? As mentioned above in Generating Tasks I can create a new table in my database, then simply by pressing some buttons I can create and then run the basic tasks to maintain that table in just 5 minutes without writing a single line of code - no PHP, no SQL, no HTML. How much code do you have to write?

Here endeth the lesson. Don't applaud, just throw money.

References

The following articles describe aspects of my framework:

The following articles express my heretical views on the topic of OOP:

These are reasons why I consider some ideas on how to do OOP "properly" to be complete rubbish:

Here are my views on changes to the PHP language and Backwards Compatibility:

The following are responses to criticisms of my methods:

Here are some miscellaneous articles:

Amendment History

12 May 2024	Added Other ways to write less code
09 Dec 2023	Added Abstraction and Step 4 - Abstraction Added Inappropriate rules

counter

Tony Marston's Blog About software development, PHP and OOP

Are you achieving the aims of OOP?