life is too short for a diary




A Philosophy of Software Design by John Ousterhout

Tags: books

Author
Written by: Tushar Sharma
Featured image for A Philosophy of Software Design by John Ousterhout
  These are my rough notes while reading this book


In the Waterfall model, development is a linear process with distinct phases: requirements, design, implementation, testing, and deployment. Each phase must be completed before the next begins. This rigid approach can lead to challenges in adapting to changing requirements or unexpected issues.

Agile methodologies, like Scrum or Kanban, emphasize iterative development. A project is broken down into smaller, manageable iterations or sprints. Each sprint involves planning, designing, implementing, and testing a specific set of features. This iterative approach allows for flexibility and continuous improvement.

Modular design is an approach where a software system is divided into distinct, relatively independent modules. Each module has two components: an interface and an implementation. The interface specifies what the module does without detailing how it accomplishes it. The implementation contains the code that fulfills the promises made by the interface.

In object-oriented programming (OOP), each class can be viewed as a module. By adhering to the interface, changes to a module's implementation do not impact other modules that depend on it. This isolation enhances flexibility and maintainability across the system.

Abstraction provides a simplified view of an entity, focusing only on essential details while omitting irrelevant information. This principle allows developers to manage complexity effectively.

An interface is like a contract that specifies methods which must be implemented by any class that adheres to it. It exposes these methods to other modules without revealing the actual implementation, ensuring that interactions occur through a consistent set of behaviors rather than direct access to the implementation."

The best modules are deep: they have a lot of functionality hidden behind a simple interface. Shallow module is one whose interface is complicated relative to the functionality it provides.

Often taught that "classes should be small". Instead classes should be deep. Small classes don't contribute much functionality and increase boilerplate. e.g.

FileInputStream fileStream = new FileInputStream(filename);
BufferedInputStream bufferedStream = new BufferedInputStream(fileStream);
ObjectInputStream objectStream = enw ObjectInputStream(bufferedStream);

In this code, FileInputStream only provides rudimentary I/O: it's not capable of bueffered I/O, nor can it read or write searialized objects. BufferedInputStream adds buffering to FileInputStream and ObjectInputStream adds read ability to read and write serialized objects. First two objects, fileStream and bufferedStream are never used once the file has been opened. So buffering must be requested explicitly. If a developer forgest to creat this object, there will be no bufferring and I/O will be slow.

Java Developers argue that since not everyone wants to use buffering for file I/O, so that's why there's an additional step. However interfaces should be desigend to make common case as simple as possible. In contrast, Unix develoeprs made sequential I/O most common use case.

A pass-through method is one that does little except invoke another method, whose signature is similar or identical to that of the calling method. They make the class more shallow, increase the complexity without increasing the functionality of system. However, one such exception like dispatcher. A dispatcher is a method that uses its arguments to select one of several other methods to invoke.

Decorators. Also called wrapper. A decorator objct takes an existing object and extends its functionality. In Java, BufferedInputStream class is a decorator: given an InputStream object, it provides the same API but intorduces bufering.

A pass-through variable is a variable that has been passed through long chain of methods. It increases complexity for intermediate methods. One Soulution - Use Global Variable. However it has it's cons. It's impossible to create independent instances of same system in same process that access same global variable (conflict). ANother solution - Use Context object. It stores all the glboal variable. Cons is that it can have thread safety issue. Solution use immutable objects in a context object.

Subdivision: Breaking up classes to divide work. However too many classes will increase complexity if there are shallow. Also subdivision works great if classes are independent. If there is dependencies between them, it would be to merge them into single class. e.g. HTTP request reader and parsing it into String would be better to have a single class.

GOTO are bad programming practice because unstructured jumps makes it hard to understand the program.

How to Split a method :

Exception Handling::

Self-Documenting Code is discouraged. Good code doesn't repeat the code. eg

// Bad comment
// Calculate the total price with tax
double total = price + (price * taxRate);

// Better: Make the code self-explanatory
double totalPriceWithTax = price + (price * taxRate);

However comments are useful eg.

// Using binary search because the dataset is pre-sorted and performance is critical
int index = binarySearch(array, target);

// Workaround for a bug in library version 1.2.3 that causes null pointer exceptions
if (value != null) {
    process(value);
}

Avoid Repeating the Code:

Comments should add value by providing context or intent, not restating what the code does. Bad Example (Repeats the code):

// Setting the status to LOADING
readRPC[i].status = LOADING;

Good Example (Explains the purpose):

// Indicate that the RPC is currently being processed
readRPC[i].status = LOADING;

Provide High-Level Information:

Comments should explain the intent or purpose of the code rather than its specific implementation details.

Ask Yourself: What is this code trying to achieve?

// Append the current hash to the RPC list for the appropriate server if it hasn’t been processed yet
if (!processedHashes.contains(currentHash)) {
    appendToRPC(currentHash, server);
}

Abstraction in Comments:

Provide a simplified view of the entity or process. Focus on essential details and omit those that can be safely ignored. Bad Example (Too Detailed):

// Check if the hash is already in the list of processed hashes, 
// then find the appropriate server using the server map, 
// and finally append the hash to the RPC queue of that server.
if (!processedHashes.contains(currentHash)) {
    appendToRPC(currentHash, server);
}

Good Example (Abstracted):

// Ensure the current hash is processed by the appropriate server
if (!processedHashes.contains(currentHash)) {
    appendToRPC(currentHash, server);
}

Use Comments for Context:

Comments should clarify the why and not just the what of the code. Example:

// Avoid duplicate processing by checking if the hash has already been handled
if (!processedHashes.contains(currentHash)) {
    appendToRPC(currentHash, server);
}

Interface documentation shouldn't have implementation details.

Comments first before writing the code. Also comments are more like abstraction rather than repeting the code

// phase 1 : scan active RPC to see if call is completed

Local Variables should be named precise. Generic names are not good. e.g.

Invariant is a property of variable or structure that's always true.

Generic container are non obvious code. e.g.

return new Pair<Intger, Boolean>();

It's much better to wrap it under a class and give it a meaningful name.

If there's a bug in the code, first write a unit test and then fix the test.


comments powered by Disqus