Surgex.Guide.SoftwareDesign (Surgex v5.0.0) View Source
Higher level application design and engineering guidelines.
Link to this section Summary
Functions
Errors should be thrown as close to the spot of failure and unhandled unless required.
Errors from external contexts should be mapped to have a meaning in the current context.
Flow control directives should be leveraged to yield compact and readable code.
Function clauses should be grouped together, ie. without a blank line between them.
Functions should be grouped by their relationship rather than by "public then private".
Usage of import
directive at module level or without the only
option should be avoided.
Non-false moduledoc should be filled only for global, context-external app modules.
Kernel macros for working with nested structures should be preferred over manual assembly.
Functions should not include more than one level of block nesting.
Keyword lists and tuples should be preferred over maps and lists for passing options.
Pattern matching should be preferred over line-by-line destructuring of maps and structs.
Predicate function names shouldn't start with is_
and should end with ?
.
Functions should return :ok
/:error
when both success and failure paths are present.
Sequential variable names, like user1
, should respect underscore naming (and be avoided).
Tests should only use
support test case modules that they need.
The unless
directive should never be used with an else
block or with logical operators.
Matches in a with-else
block should be placed in occurrence order.
Redundant else
block should not be provided for the with
directive.
An else
block should be provided for with
when it forwards cases from external files.
Link to this section Functions
Errors should be thrown as close to the spot of failure and unhandled unless required.
Reasoning
Throwing an exception (or using a throwing equivalent of a standard library function) allows to avoid spending an additional time on inventing failure paths in the code and handling them higher in the call stack.
It may be tempting to go with an error return value, such as the {:error, ...}
tuple, in order
to let the code higher in the call stack to decide what to do in a specific situation, but that
only makes sense when it makes sense, ie. there exists a valid case higher in the call stack that
would want to do something other than throwing or returning a meaningless, generic error.
Otherwise, when a hard system-wide failure ends up not being an exception, it may look like a step towards reusability, but it's really anti-semantic and the specific code unit (function or module) stops telling the whole story, ie. multiple files must be read in order to come up with a simple conclusion that we really end up with an exception anyway.
This may be extra important during a debugging session, since the closer an exception happens to
the spot of failure, the easier it is for developer to understand the real reason behind it. For
the same reason, it's always better to use a throwing equivalent of a standard library function
(with the !
suffix) in places which don't handle the negative scenario anyway further down the
pipe.
Examples
Preferred:
def do_something_external(params) do
required = Keyword.fetch!(params, :required)
optional = Keyword.get(params, :optional)
integer =
params
|> Keyword.fetch!(:integer)
|> String.to_integer
case external_api_call(required, optional, integer) do
%{status: 200, body: body} ->
body["result"]
%{status: error_status, body: error_body} ->
raise("External API error #{error_status}: #{inspect error_body}")
end
end
Bad code (read the explanation below):
def do_something_external(params) do
required = Keyword.get(params, :required)
optional = Keyword.get(params, :optional)
{integer, _} =
params
|> Keyword.fetch!(:integer)
|> Integer.parse
case external_api_call(required, optional, integer) do
%{status: 200, body: body} ->
{:ok, body["result"]}
_ ->
{:error, :external_api_failed}
end
end
There are following problems in the code above:
not throwing on forgotten
:required_option
as early as possible will yield problems further down the pipe that will be hard to debug since debugging session will have to track the issue back to the original spot that we could've thrown at since the beginningnot using optimal standard library means for throwing a descriptive error for failed string to integer conversion (
String.to_integer
) will yield a less descriptive match error (and the match-all on a 2nd elem of tuple fromInteger.parse
may produce bugs)returning
{:error, :external_api_failed}
on failure from external API will force the caller ofdo_something_external
to handle this case, so it makes sense only if we can actually do something that makes sense (other than raising, silencing the issue or making it ambiguous)
Errors from external contexts should be mapped to have a meaning in the current context.
Reasoning
Elixir allows to match and forward everything in case
and with-else
match clauses (which are
often used to control the high level application flow) or to simply omit else
for with
. This
often results in bubbling up errors, such as those in {:error, reason}
tuples, to the next
context in which those errors are ambiquous or not fitting the context into which they traverse.
For instance, {:error, :forbidden}
returned from a HTTP client is ambiguous and not fitting the
context of a service or controller that calls it. The following questions are unanswered:
- what exactly is forbidden?
- why would I care if it's forbidden and not, for instance, temporarily unavailable?
- what actually went wrong?
- how does it map to actual input args?
A reverse case is also possible when errors in lower contexts are intentionally named to match
upper context expectations, breaking the separation of concerns. For instance, a service may
return {:error, :not_found}
or {:error, :forbidden}
in order to implicitly fall into fallback
controller's expectations, even though a more descriptive error naming could've been invented.
Therefore, care should be put into naming errors in a way that matters in the contexts where
they're born and into leveraging case
and with-else
constructs to re-map ambiguous or not
fitting errors into a meaningful and fitting ones when they travel across context bounds.
Examples
Preferred:
defmodule RegistrationService do
def call(attrs) do
with {:ok, user} <- CreateUserFromAttributesService.call(attrs),
:ok <- SendUserWelcomeEmailService.call(user)
do
{:ok, user}
else
{:error, changeset = %Ecto.Changeset{}} -> {:error, :invalid_attributes, changeset}
{:error, :not_available} -> {:error, :mailing_service_not_available}
end
end
end
Ambiguous and "out of context" errors:
defmodule RegistrationService do
def call(attrs) do
with {:ok, user} <- CreateUserFromAttributesService.call(attrs),
:ok <- SendUserWelcomeEmailService.call(user)
do
{:ok, user}
else
{:error, changeset = %Ecto.Changeset{}} -> {:error, changeset}
{:error, :not_available} -> {:error, :not_available}
end
end
end
Flow control directives should be leveraged to yield compact and readable code.
Reasoning
Each of flow control directives (if
, cond
, case
, with
) has its own purpose, but sometimes
more than one of them can be used to achieve the same goal. In such cases, the one that yields the
most compact and readable code should be picked.
Examples
Preferred:
with {:ok, user} <- load_user(id),
{:ok, avatar} <- load_user_avatar(user)
do
{:ok, user, avatar}
end
Redundant case
equivalent of the above:
case load_user(id) do
{:ok, user} ->
case load_user_avatar(user) do
{:ok, avatar} ->
{:ok, user, avatar}
error -> error
end
error -> error
end
Function clauses should be grouped together, ie. without a blank line between them.
Reasoning
This allows to easily read a whole set of specific function's clauses and spot the start and end of the whole story of that specific function.
Examples
Preferred:
def active?(%User{confirmed_at: nil}), do: false
def active?(%User{}), do: true
def deleted?(%User{deleted_at: nil}), do: false
def deleted?(%User{}), do: true
No obvious visual bounds for each function:
def active?(%User{confirmed_at: nil}), do: false
def active?(%User{}), do: true
def deleted?(%User{deleted_at: nil}), do: false
def deleted?(%User{}), do: true
Functions should be grouped by their relationship rather than by "public then private".
Reasoning
The existence of a def
+ defp
directive pair allows to leave behind the old habits for
defining all the public functions before private ones. Keeping related functions next to each
other allows to read the code faster and to easily get the grasp of the whole module flow.
The best rule of thumb is to place every private function directly below first other function that calls it.
Examples
Preferred:
def a, do: b()
defp a_helper, do: nil
def b, do: nil
defp b_helper, do: nil
Harder to read:
def a, do: b()
def b, do: nil
defp a_helper, do: nil
defp b_helper, do: nil
Usage of import
directive at module level or without the only
option should be avoided.
Reasoning
When importing at module level, one adds a set of foreign functions to the module that may conflict with existing ones. This gets worse when multiple modules are imported and their names start to clash with each other. When project complexity increases over time and the preference for imports over aliases grows, the developer will sooner or later be forced to name functions in a custom to-be-imported module in a way that scopes them in a target module and/or avoids naming conflicts with other to-be-imported modules. This results in bad function naming - names start to be unnecessarily long or to repeat the module name in a function name.
When importing without the only
option, it's unclear without visiting the source of imported
module what exact function names and arities come from the external place. This makes the code
harder to reason about.
Examples
Preferred:
defmodule User do
def full_name(%{first_name: first_name, last_name: last_name}) do
import Enum, only: [join: 2]
join([first_name, last_name])
end
end
Too wide scope:
defmodule User do
import Enum, only: [join: 2]
def full_name(%{first_name: first_name, last_name: last_name}) do
join([first_name, last_name])
end
end
Unknown imports:
defmodule User do
def full_name(%{first_name: first_name, last_name: last_name}) do
import Enum
join([first_name, last_name])
end
end
Non-false moduledoc should be filled only for global, context-external app modules.
Reasoning
Filling moduledoc results in adding the module to module list in the documentation. Therefore, it makes little sense to use it only to leave a comment about internal mechanics of specific module or its meaning in the context of a closed application domain. For such cases, regular comments should be used. This will yield a clean documentation with eagle-eye overview of the system and its parts that can be directly used from global or external perspective.
Example
Preferred:
defmodule MyProject.Accounts do
@moduledoc """
Account management system.
"""
@doc """
Registers an user account.
"""
def register(attrs) do
MyProject.Accounts.RegistrationService.call(attrs)
end
end
defmodule MyProject.Accounts.RegistrationService do
@moduledoc false
# Fails on occasion due to Postgres connection issue.
# Works best on Fridays.
def call(attrs) do
# ...
end
end
Unnecessary external-ization and comment duplication:
defmodule MyProject.Accounts do
@moduledoc """
Account management system.
"""
@doc """
Registers an user account.
"""
def register(attrs) do
MyProject.Accounts.RegistrationService.call(attrs)
end
end
defmodule MyProject.Accounts.RegistrationService do
@moduledoc """
Registers an user account.
Fails on occasion due to Postgres connection issue.
Works best on Fridays.
"""
def call(attrs) do
# ...
end
end
Kernel macros for working with nested structures should be preferred over manual assembly.
This is about macros from the *_in
family in the Elixir.Kernel
module, like pop_in
,
put_in
or update_in
.
Reasoning
Using these macros can vastly reduce the amount of code amd ensure that the complexity of digging and modifying nested structures is handled in the fastest way possible, as guaranteed by relying on a standard library. Implementing these flows manually leads to repetitive code and an open door for extra bugs.
Examples
Preferred:
opts = [
user: [
name: "John",
email: "user#xample.com"
]
]
opts_with_phone = put_in opts[:user][:phone], "+48 600 700 800"
Unneeded complexity:
opts = [
user: [
name: "John",
email: "user#xample.com"
]
]
user_with_phone = Keyword.put(opts[:user], :phone, "+48 600 700 800")
opts_with_phone = Keyword.put(opts, :user, user_with_phone)
Functions should not include more than one level of block nesting.
Reasoning
Constructs like with
, case
, cond
, if
or fn
often need their own vertical space in order
to make them readable, avoid cluttering and explicitly express dependencies needed by each block.
Therefore, if they appear within each other, it should be preferred to extract the nested logic to
separate function. This will often yield a good chance to replace some of these constructs with
preferred solution of pattern matching function arguments.
Examples
Preferred:
def calculate_total_cart_price(cart, items_key \\ :items, omit_below \\ 0) do
reduce_cart_items_price(cart[items_key], omit_below)
end
defp sum_cart_items_price(nil, _omit_below), do: 0
defp sum_cart_items_price(items, omit_below) do
Enum.reduce(items, 0, &reduce_cart_item_price(&1, &2, omit_below))
end
defp reduce_cart_item_price(%{price: price}, total, omit_below) when price < omit_below do
total
end
defp reduce_cart_item_price(%{price: price}, total, _omit_below) do
total + price
end
Cluttered and without obvious variable dependencies (items_key
is not used in the deepest block
while omit_below
is):
def calculate_total_cart_price(cart, items_key \\ :items, omit_below \\ 0) do
if cart[items_key] do
Enum.reduce(cart[items_key], 0, fn %{price: price}, total ->
if price < omit_below do
total
else
total + price
end
end)
else
0
end
end
Keyword lists and tuples should be preferred over maps and lists for passing options.
Reasoning
Keyword lists and tuples are a standard, conventional means for passing internal information between Elixir modules.
Keyword lists enforce a usage of atoms for keys and allow to pass single key more than once and in
specific order when that's desired (and provide a merge
function for when that's not desired).
The price for last two feats is that they are not pattern-matchable (and should never be pattern
matched) in cases when order and duplication is not important - functions from the
Elixir.Keyword
module should be used in those cases. Ot the other hand, pattern matching may
come handy when parsing options with significant order of keys.
Tuples declare a syntax for short, efficient, predefined lists and are useful in simpler and
convention-driven cases, in which key naming is not needed. For instance, there's an established
convention to return {:ok, result}
/{:error, reason}
tuples from actions that can succeed or
fail without throwing.
Examples
Preferred:
defp create_user(attrs, opts \\ []) do
# required option
auth_scope = Keyword.fetch!(opts, :send_welcome_email, false)
# options with defaults
send_welcome_email = Keyword.get(opts, :send_welcome_email, false)
mark_as_confirmed = Keyword.get(opts, :mark_as_confirmed, true)
case Repo.insert(%User{}, attrs) do
{:ok, user} ->
final_user =
user
|> send_email(send_welcome_email)
|> confirm(mark_as_confirmed)
{:ok, final_user}
{:error, changeset} ->
{:error, map_changeset_errors_to_error_reason(changeset.errors)}
end
end
Invalid usage of maps over keyword lists:
defp create_user(attrs, opts = %{}) do
# ...
end
Invalid usage of lists over tuples:
defp create_user(attrs) do
# ...
[:ok, user]
end
Pattern matching should be preferred over line-by-line destructuring of maps and structs.
Reasoning
Pattern matching can be used to vastly simplify destructuring of complicated structures, so it
should be used whenever possible, instead of taking out field by field via a struct getter (.
)
or an access operator ([]
).
It's supported in function clauses, so extensive use of the feature will also encourage writing more pattern-matched functions, which should in turn yield a code easier to parse for Elixir developers. Function headers with long matches can be easily broken into multiple lines and indented in a clean way, so the length of a match should not be the factor for making a decision about using or not using it.
Even outside of function clauses, pattern matching is a blazing fast VM-supported feature that, combined with guards unwrapped at compilation time, should yield the best possible code performance.
It's also worth mentioning that pattern matching can be also done inside of the assert
macro
in ExUnit
in order to write selective, nicely diffed assertions on maps and structs.
Pattern matching should not be preferred over functions from Keyword
module for destructuring
option lists, even if they can hold only one possible option at a time.
Examples
Preferred in function clauses:
def create_user_from_json_api_document(%{
"data" => %{
"id" => id,
"attributes" => %{
"name" => name,
"email" => email,
"phone" => phone
}
}
}, mailing_enabled) do
user = insert_user(id, name, email, phone)
if mailing_enabled, do: send_welcome_email(user)
end
Preferred in tests:
assert %User{
name: "John",
phone: "+48 600 700 800"
} == CreateUserAction(name: "John", email: email_sequence(), phone: "+48 600 700 800")
Cluttered:
id = doc["data"]["id"]
name = doc["data"]["attributes"]["name"]
email = doc["data"]["attributes"]["email"]
phone = doc["data"]["attributes"]["phone"]
Predicate function names shouldn't start with is_
and should end with ?
.
Reasoning
It's an Elixir convention to name predicate functions with a ?
suffix. It leverages the fact
that this character can appear as function name suffix to make it easier to differentiate such
functions from others.
It's also an Elixir convention not to name predicate functions with a is_
prefix, since that
prefix is reserved for guard-enabled predicate macros.
Note that this rule doesn't apply to service functions that return success tuples instead of plain boolean values.
Examples
Preferred:
def active?(user), do: true
Function that pretends to be a guard:
def is_active?(user), do: true
Function that pretends not to be a predicate:
def active(user), do: true
Functions should return :ok
/:error
when both success and failure paths are present.
Reasoning
First of all, we do want to adhere to the long-standing Elixir convention of returning
:ok
/:error
atoms from functions. They may either be stand-alone (simple :ok
/:error
when
there's nothing more to add) or wrapped in a tuple with extra contextual info, such as {:ok, fetched_data}
or {:error, reason}
. Tuples may be mixed with stand-alone atoms, eg. the same
function may return :ok
upon success (since there's nothing more to add upon success) while
multiple distinct error paths may return {:error, reason}
to make them distinct to the caller.
That said, there's a case when usage of this pattern may make the code more confusing. It's when
specific code simply cannot fail. If it cannot fail, then it doesn't make sense to make it tell
its caller that something went ok. In such cases, the function should simply return the value that
was asked for (fetched_data
in example above) or nil
if there's nothing to return (eg. when a
non-failing function only creates side effects).
This fits nicely into the way the Elixir standard library is designed (eg. Map.get/2
never fails
so it only returns the value but Map.fetch/2
does fail so it returns {:ok, value}
or
:error
). As such, this rule makes our code consistent with Elixir conventions and community code
that's supposed to follow them.
Refer to the
Surgex.Guide.SoftwareDesign.error_handling/0
rule in order to learn when to actually implement the failure path.
Examples
Preferred:
def print_debug_info(message) do
IO.puts(message)
nil
end
def remove_file(path) do
if File.exists?(path)
:ok = File.rm(path)
else
{:error, :file_not_found}
end
end
Confusing :ok
when there's no failure path (IO.puts/1
returns :ok
):
def print_debug_info(message) do
IO.puts(message)
end
def remove_file(path) do
if File.exists?(path)
:ok = File.rm(path)
else
raise("No such file: #{inspect path}")
end
end
Lack of :ok
when there's a failure path (File.read!/1
returns the file content):
def read_file(path) do
if File.exists?(path)
File.read!(path)
else
{:error, :file_not_found}
end
end
Sequential variable names, like user1
, should respect underscore naming (and be avoided).
Reasoning
Sequential variable names should be picked only as a last resort, since they're hard to express
in underscore notation and are non-descriptive. For instance, in comparison function
compare(integer_1, integer_2)
can be replaced with compare(integer, other_integer)
.
Sequence number added as suffix without the underscore is a breakage of underscore naming and
looks especially bad when the name consists of more than one word, like user_location1
.
Examples
Preferred:
def compare(integer, other_integer), do: # ...
Preferred as last resort:
def add_three_nums(integer_1, integer_2, integer_3), do: # ...
Plain ugly:
def concat(file_name1, file_name2), do: # ...
Tests should only use
support test case modules that they need.
Reasoning
If specific test only unit tests a module without using a web request, it shouldn't use ConnCase
and if it doesn't create records, it shouldn't use DataCase
. For many tests, ExUnit.Case
will
be enough of a support.
This yields more semantic test headers and avoids needlessly importing and abusing of more complex support files.
Examples
Preferred:
defmodule MyProject.Web.MyControllerTest do
use MyProject.Web.ConnCase
end
defmodule MyProject.MyServiceTest do
use MyProject.DataCase
end
defmodule NeitherControllerNorDatabaseTest do
use ExUnit.Case
end
Test support file abuse:
defmodule MyProject.MyServiceTest do
use MyProject.Web.ConnCase
end
defmodule NeitherControllerNorDatabaseTest do
use MyProject.DataCase
end
The unless
directive should never be used with an else
block or with logical operators.
Reasoning
The unless
directive is confusing and hard to reason about when used with more complex
conditions or an alternative code path (which could be read as "unless unless"). Therefore, in
such cases it should be rewritten as an if
.
Examples
Preferred:
unless user.confirmed, do: raise("user is not confirmed")
if user.banned and not(user.vip) do
raise("user is banned")
else
confirm_action(user)
end
Too hard to read:
unless not(user.banned) or user.vip do
confirm_action(user)
else
raise("user is banned")
end
Matches in a with-else
block should be placed in occurrence order.
Reasoning
Doing this will make it much easier to reason about the whole flow of the with
block, which
tends to be quite complex and a core of flow control.
Examples
Preferred:
defmodule RegistrationService do
def call(attrs) do
with {:ok, user} <- CreateUserFromAttributesService.call(attrs),
:ok <- SendUserWelcomeEmailService.call(user)
do
{:ok, user}
else
{:error, changeset = %Ecto.Changeset{}} -> {:error, changeset}
{:error, :not_available} -> {:error, :not_available}
end
end
end
Unclear flow:
defmodule RegistrationService do
def call(attrs) do
with {:ok, user} <- CreateUserFromAttributesService.call(attrs),
:ok <- SendUserWelcomeEmailService.call(user)
do
{:ok, user}
else
{:error, :not_available} -> {:error, :not_available}
{:error, changeset = %Ecto.Changeset{}} -> {:error, changeset}
end
end
end
Redundant else
block should not be provided for the with
directive.
Reasoning
In cases when all the code called in with
resides in the same file (or in a standard library)
and when none of else
clauses would override the negative path's output, it's more semantic and
descriptive to simply drop the else
entirely. It's worth noting that else
blocks in with
bring an additional maintenance cost so it should be excused by either of conditions mentioned
above.
Examples
Preferred:
defmodule RegistrationService do
def call(attrs) do
with {:ok, user} <- insert_user(attrs),
:ok <- send_welcome_email(user)
do
{:ok, user}
end
end
defp insert_user(attrs), do: # ...
defp send_welcome_email(user), do: # ...
end
Redundant and hard to maintain else
:
defmodule RegistrationService do
def call(attrs) do
with {:ok, user} <- insert_user(attrs),
:ok <- send_welcome_email(user)
do
{:ok, user}
else
{:error, :insertion_error_a} -> {:error, :insertion_error_a}
{:error, :insertion_error_b} -> {:error, :insertion_error_b}
{:error, :insertion_error_c} -> {:error, :insertion_error_c}
{:error, :mailing_service_error_a} -> {:error, :mailing_service_error_a}
{:error, :mailing_service_error_b} -> {:error, :mailing_service_error_b}
{:error, :mailing_service_error_c} -> {:error, :mailing_service_error_c}
end
end
defp insert_user(attrs), do: # ...
defp send_welcome_email(user), do: # ...
end
An else
block should be provided for with
when it forwards cases from external files.
Reasoning
The with
clause allows to omit else
entirely if its only purpose is to amend the specific
series of matches filled between with
and do
. In such cases, all non-matching outputs are
forwarded (or "bubbled up") by with
. This is a cool feature that allows to reduce the amount of
redundant negative matches when there's no need to amend them.
It may however become a readability and maintenance problem when with
calls to complex, external
code from separate files, which makes it hard to reason about the complete set of possible
outcomes of the whole with
block. Therefore, it's encouraged to provide an else
which lists
a complete set of possible negative scenarios, even if they are not mapped to a different output.
Examples
Preferred:
defmodule RegistrationService do
def call(attrs) do
with {:ok, user} <- CreateUserFromAttributesService.call(attrs),
:ok <- SendUserWelcomeEmailService.call(user)
do
{:ok, user}
else
{:error, changeset = %Ecto.Changeset{}} -> {:error, changeset}
{:error, :not_available} -> {:error, :not_available}
end
end
end
Unclear cross-module flow:
defmodule RegistrationService do
def call(attrs) do
with {:ok, user} <- CreateUserFromAttributesService.call(attrs),
:ok <- SendUserWelcomeEmailService.call(user)
do
{:ok, user}
end
end
end