Thoughts on code reviews and the development process
These days, most software projects set the master branch to be read-only, use pull requests to merge feature branches, and require code reviews with multiple approvals. Theoretically, this is a great practice, no doubt; but, as always, the devil is in the details, and typical code reviews are frequently ineffective or even harmful.
- Most standard code review tools display all changes in a flat list, so the context of these changes is completely lost. The order in which files appear has nothing to do with the logical organization of the change (or the codebase in general).
- It is extremely hard to reason about changes larger than just a few files with more than just 10-20 added/modified lines of code in each, especially when these changes mix design concepts and low-level implementations of particular functions. It requires a very deep context switch and extreme discipline from the reviewer.
- As a result, many reviews focus only on very simple things. Things that could and should be discovered by code analysis tools, IDE inspections, SonarQube, and other automatic quality gates. The reviewer’s time is expensive, and code analysis tools are cheap.
- When PR is submitted for review, the code is already written. It is often too late to comment on the fundamental problems in the design. Such comments are hard to address, and naturally, they are met with lots of resistance and even anger from the code author.
- Often there is a tight deadline to deliver the feature, so the pressure goes on the reviewer: “It is his fault we didn’t deliver the fix on time, he didn’t timely approve my code change”.
- Typically, the author of the PR selects the reviewers. This leads to an uneven distribution of the reviews (which consume significant time), or sometimes it promotes selecting reviewers that “go easy on everyone”.
What’s the solution?
I propose using the following process:
- The developer starts working on the assigned feature/ticket, e.g., TICKET-123.
- Create and check out the feature branch: feature/t123/my-feature.
- Analyze the feature, and if it is complex enough, write a design document. Depending on the complexity, this could be just a few sentences in the MD file or a full wiki page with diagrams, etc.
- Create another branch feature/t123/my-feature-design, commit your design document (or link to it) there, and submit the PR with the overall feature/t123/my-feature as a target. Some projects might require that only tech leads or architects review such “design” PRs.
- Once “Design PR” is approved, merge it and create another branch feature/t123/my-feature-blueprint. The purpose of this branch is to create the overall skeleton of the feature, adding lots of placeholders for every implementation detail. The more placeholders you add, the better it is. Each such placeholder becomes a subtask of the feature. These placeholders might use some special markers, like // PH-1 User input validation, and your IDE can be easily configured to display the consolidated list of such markers. After this is done, submit the PR to merge feature/t123/my-feature-blueprint into feature/t123/my-feature. Here, your reviewers will have an opportunity to evaluate the overall feature design at the code level and express their concerns or suggestions.
- Next, implement every placeholder defined in the previous step. Each placeholder is implemented in a separate branch that is forked from feature/t123/my-feature, and a dedicated PR is submitted for each. Note that at this stage, “placeholder” branches could be implemented in parallel by several developers. Ideally, reviewers of these placeholder implementation PRs should be assigned randomly, promoting a balanced workload as well as knowledge sharing within the development team.
- Once all placeholders are resolved, submit the final PR merging the overall feature/t123/my-feature into the main branch. Since every change in this PR has already been reviewed and approved, the goal of this PR is to simply verify the commit history of the feature/t123/my-feature branch and make sure no direct code commits are done.
- Finally, merge the feature branch into the master.
This process has many advantages. First of all, reviewers have a chance to comment on feature design before too many resources are spent on implementation. Secondly, instead of one huge PR, we are dealing with several small PRs, each one staying at a single level of abstraction and never mixing the overall design with the small details of a particular step. It is easy to understand and provide meaningful comments for such PRs. It promotes knowledge sharing in the team and minimizes frustration and conflicts when some big PRs are rejected.
As usual, comments, opinions, and criticism are more than welcome!