ansible/docs/docsite/rst/user_guide/playbooks_error_handling.rst

Error Handling In Playbooks
===========================

.. contents:: Topics

Ansible normally has defaults that make sure to check the return codes of commands and modules and
it fails fast -- forcing an error to be dealt with unless you decide otherwise.

Sometimes a command that returns different than 0 isn't an error.  Sometimes a command might not always
need to report that it 'changed' the remote system.  This section describes how to change
the default behavior of Ansible for certain tasks so output and error handling behavior is
as desired.

.. _ignoring_failed_commands:

Ignoring Failed Commands
````````````````````````

Generally playbooks will stop executing any more steps on a host that has a task fail.
Sometimes, though, you want to continue on.  To do so, write a task that looks like this::

    - name: this will not be counted as a failure
      command: /bin/false
      ignore_errors: yes

Note that the above system only governs the return value of failure of the particular task,
so if you have an undefined variable used or a syntax error, it will still raise an error that users will need to address.
Note that this will not prevent failures on connection or execution issues.
This feature only works when the task must be able to run and return a value of 'failed'.

.. _resetting_unreachable:

Resetting Unreachable Hosts
```````````````````````````

.. versionadded:: 2.2

Connection failures set hosts as 'UNREACHABLE', which will remove them from the list of active hosts for the run.
To recover from these issues you can use `meta: clear_host_errors` to have all currently flagged hosts reactivated,
so subsequent tasks can try to use them again.


.. _handlers_and_failure:

Handlers and Failure
````````````````````

When a task fails on a host, handlers which were previously notified
will *not* be run on that host. This can lead to cases where an unrelated failure
can leave a host in an unexpected state. For example, a task could update
a configuration file and notify a handler to restart some service. If a
task later on in the same play fails, the service will not be restarted despite
the configuration change.

You can change this behavior with the ``--force-handlers`` command-line option,
or by including ``force_handlers: True`` in a play, or ``force_handlers = True``
in ansible.cfg. When handlers are forced, they will run when notified even
if a task fails on that host. (Note that certain errors could still prevent
the handler from running, such as a host becoming unreachable.)

.. _controlling_what_defines_failure:

Controlling What Defines Failure
````````````````````````````````

Suppose the error code of a command is meaningless and to tell if there
is a failure what really matters is the output of the command, for instance
if the string "FAILED" is in the output.

Ansible provides a way to specify this behavior as follows::

    - name: Fail task when the command error output prints FAILED
      command: /usr/bin/example-command -x -y -z
      register: command_result
      failed_when: "'FAILED' in command_result.stderr"

or based on the return code::

    - name: Fail task when both files are identical
      raw: diff foo/file1 bar/file2
      register: diff_cmd
      failed_when: diff_cmd.rc == 0 or diff_cmd.rc >= 2

In previous version of Ansible, this can still be accomplished as follows::

    - name: this command prints FAILED when it fails
      command: /usr/bin/example-command -x -y -z
      register: command_result
      ignore_errors: True

    - name: fail the play if the previous command did not succeed
      fail:
        msg: "the command failed"
      when: "'FAILED' in command_result.stderr"

.. _override_the_changed_result:

Overriding The Changed Result
`````````````````````````````

When a shell/command or other module runs it will typically report
"changed" status based on whether it thinks it affected machine state.

Sometimes you will know, based on the return code
or output that it did not make any changes, and wish to override
the "changed" result such that it does not appear in report output or
does not cause handlers to fire::

    tasks:

      - shell: /usr/bin/billybass --mode="take me to the river"
        register: bass_result
        changed_when: "bass_result.rc != 2"

      # this will never report 'changed' status
      - shell: wall 'beep'
        changed_when: False

Aborting the play
`````````````````

Sometimes it's desirable to abort the entire play on failure, not just skip remaining tasks for a host.

The ``any_errors_fatal`` play option will mark all hosts as failed if any fails, causing an immediate abort::

     - hosts: somehosts
       any_errors_fatal: true
       roles:
         - myrole

for finer-grained control ``max_fail_percentage`` can be used to abort the run after a given percentage of hosts has failed.


.. seealso::

   :doc:`playbooks`
       An introduction to playbooks
   :doc:`playbooks_best_practices`
       Best practices in playbooks
   :doc:`playbooks_conditionals`
       Conditional statements in playbooks
   :doc:`playbooks_variables`
       All about variables
   `User Mailing List <http://groups.google.com/group/ansible-devel>`_
       Have a question?  Stop by the google group!
   `irc.freenode.net <http://irc.freenode.net>`_
       #ansible IRC chat channel
Tweaking doc titles. 2013-09-29 23:10:28 +00:00			`Error Handling In Playbooks`
			`===========================`
Various docs reorg and additions 2012-05-13 15:00:02 +00:00
Add contents to various pages with more than one topic per page. 2013-12-26 19:32:01 +00:00			`.. contents:: Topics`

Explain error handling section better. 2013-10-03 02:03:15 +00:00			`Ansible normally has defaults that make sure to check the return codes of commands and modules and`
			`it fails fast -- forcing an error to be dealt with unless you decide otherwise.`
fix various typos in the documentation 2013-11-16 22:01:26 +00:00
fixed phrasing on 0 being non error 2017-01-14 01:55:19 +00:00			`Sometimes a command that returns different than 0 isn't an error. Sometimes a command might not always`
Conditionals chapter and some misc tweaks elsewhere 2013-09-29 20:47:34 +00:00			`need to report that it 'changed' the remote system. This section describes how to change`
			`the default behavior of Ansible for certain tasks so output and error handling behavior is`
			`as desired.`
Updating various doc items with 0.6 features, releasing soon, and removing references to things new in 0.4, which has been out long enough to no longer be new. 2012-08-01 02:19:04 +00:00
More exciting anchor tags, document more loop features! 2013-10-04 22:34:39 +00:00			`.. _ignoring_failed_commands:`

Added info about ignore_errors: True and added a user to the who uses page 2012-08-03 02:08:00 +00:00			`Ignoring Failed Commands`
			````````````````````````

Clear host errors (#18127) * document meta: clear_host_errors * Update playbooks_error_handling.rst Minor edit 2016-11-16 19:49:20 +00:00			`Generally playbooks will stop executing any more steps on a host that has a task fail.`
			`Sometimes, though, you want to continue on. To do so, write a task that looks like this::`
Added info about ignore_errors: True and added a user to the who uses page 2012-08-03 02:08:00 +00:00
			`- name: this will not be counted as a failure`
Update to conventional task format 2013-07-15 17:50:48 +00:00			`command: /bin/false`
Make use of yes/no booleans in playbooks At the moment Ansible prefers yes/no for module booleans, however booleans in playbooks are still using True/False, rather than yes/no. This changes modifies boolean uses in playbooks (and man pages) to favor yes/no rather than True/False. This change includes: - Adaptation of documentation and examples to favor yes/no - Modification to manpage output to favor yes/no (the docsite output already favors yes/no) 2012-12-14 10:56:53 +00:00			`ignore_errors: yes`
Added info about ignore_errors: True and added a user to the who uses page 2012-08-03 02:08:00 +00:00
clarified ignore_errors 2015-10-27 18:29:37 +00:00			`Note that the above system only governs the return value of failure of the particular task,`
Clear host errors (#18127) * document meta: clear_host_errors * Update playbooks_error_handling.rst Minor edit 2016-11-16 19:49:20 +00:00			`so if you have an undefined variable used or a syntax error, it will still raise an error that users will need to address.`
			`Note that this will not prevent failures on connection or execution issues.`
			`This feature only works when the task must be able to run and return a value of 'failed'.`

			`.. _resetting_unreachable:`

			`Resetting Unreachable Hosts`
			```````````````````````````

			`.. versionadded:: 2.2`

			`Connection failures set hosts as 'UNREACHABLE', which will remove them from the list of active hosts for the run.`
			To recover from these issues you can use `meta: clear_host_errors` to have all currently flagged hosts reactivated,
			`so subsequent tasks can try to use them again.`
clarified ignore_errors 2015-10-27 18:29:37 +00:00
Some minor docs corrections. 2014-02-15 19:05:42 +00:00
Fix --force-handlers, and allow it in plays and ansible.cfg The --force-handlers command line argument was not correctly running handlers on hosts which had tasks that later failed. This corrects that, and also allows you to specify force_handlers in ansible.cfg or in a play. 2015-04-04 20:37:14 +00:00			`.. _handlers_and_failure:`

			`Handlers and Failure`
			````````````````````

			`When a task fails on a host, handlers which were previously notified`
			`will not be run on that host. This can lead to cases where an unrelated failure`
			`can leave a host in an unexpected state. For example, a task could update`
			`a configuration file and notify a handler to restart some service. If a`
			`task later on in the same play fails, the service will not be restarted despite`
			`the configuration change.`

			You can change this behavior with the ``--force-handlers`` command-line option,
			or by including ``force_handlers: True`` in a play, or ``force_handlers = True``
			`in ansible.cfg. When handlers are forced, they will run when notified even`
			`if a task fails on that host. (Note that certain errors could still prevent`
			`the handler from running, such as a host becoming unreachable.)`

Fix REST anchor targets 2013-10-12 15:20:56 +00:00			`.. _controlling_what_defines_failure:`
Add an example of failed_when. 2013-10-12 14:51:32 +00:00
			`Controlling What Defines Failure`
			````````````````````````````````

Fix typo 2013-10-17 14:00:58 +00:00			`Suppose the error code of a command is meaningless and to tell if there`
Add an example of failed_when. 2013-10-12 14:51:32 +00:00			`is a failure what really matters is the output of the command, for instance`
Add example task succeeding when RC is non-zero (#23698) * Add example task succeeding when RC is non-zero I added an example on how to use the return code to decide yourself what is considered a failure. This might have helped for #23679. * Use diff as example command, instead of robocopy 2017-04-18 14:17:52 +00:00			`if the string "FAILED" is in the output.`
Add an example of failed_when. 2013-10-12 14:51:32 +00:00
Removing obsolete version behavior callout notes - initial pass. (#33172) * Removing obsolete version behavior callout notes - initial pass. * Fixed example text punctuation. * Removed old version callout. * Typo fix * Updated example * Fixed awkward sentence * Fixed incorrect feature name * Reinstated current version callout 2017-11-22 04:14:27 +00:00			`Ansible provides a way to specify this behavior as follows::`
Add an example of failed_when. 2013-10-12 14:51:32 +00:00
Add example task succeeding when RC is non-zero (#23698) * Add example task succeeding when RC is non-zero I added an example on how to use the return code to decide yourself what is considered a failure. This might have helped for #23679. * Use diff as example command, instead of robocopy 2017-04-18 14:17:52 +00:00			`- name: Fail task when the command error output prints FAILED`
Add an example of failed_when. 2013-10-12 14:51:32 +00:00			`command: /usr/bin/example-command -x -y -z`
			`register: command_result`
			`failed_when: "'FAILED' in command_result.stderr"`

Add example task succeeding when RC is non-zero (#23698) * Add example task succeeding when RC is non-zero I added an example on how to use the return code to decide yourself what is considered a failure. This might have helped for #23679. * Use diff as example command, instead of robocopy 2017-04-18 14:17:52 +00:00			`or based on the return code::`

			`- name: Fail task when both files are identical`
			`raw: diff foo/file1 bar/file2`
			`register: diff_cmd`
			`failed_when: diff_cmd.rc == 0 or diff_cmd.rc >= 2`

Fix typo in playbooks_error_handling (#31636) 2017-10-13 04:08:35 +00:00			`In previous version of Ansible, this can still be accomplished as follows::`
Add an example of failed_when. 2013-10-12 14:51:32 +00:00
			`- name: this command prints FAILED when it fails`
			`command: /usr/bin/example-command -x -y -z`
			`register: command_result`
			`ignore_errors: True`

			`- name: fail the play if the previous command did not succeed`
Update example syntax in playbooks_error_handling.rst. (#35675) 2018-02-03 11:29:58 +00:00			`fail:`
			`msg: "the command failed"`
Add an example of failed_when. 2013-10-12 14:51:32 +00:00			`when: "'FAILED' in command_result.stderr"`

More exciting anchor tags, document more loop features! 2013-10-04 22:34:39 +00:00			`.. _override_the_changed_result:`

Tweaking doc titles. 2013-09-29 23:10:28 +00:00			`Overriding The Changed Result`
			`````````````````````````````
Document changed_when clause 2013-07-14 19:43:10 +00:00
Slight edit to when_changed docs 2013-07-21 14:48:22 +00:00			`When a shell/command or other module runs it will typically report`
Updated documentation. 2013-07-23 22:49:27 +00:00			`"changed" status based on whether it thinks it affected machine state.`
Slight edit to when_changed docs 2013-07-21 14:48:22 +00:00
			`Sometimes you will know, based on the return code`
			`or output that it did not make any changes, and wish to override`
			`the "changed" result such that it does not appear in report output or`
			`does not cause handlers to fire::`
Document changed_when clause 2013-07-14 19:43:10 +00:00
			`tasks:`
Slight edit to when_changed docs 2013-07-21 14:48:22 +00:00
			`- shell: /usr/bin/billybass --mode="take me to the river"`
			`register: bass_result`
			`changed_when: "bass_result.rc != 2"`

			`# this will never report 'changed' status`
			`- shell: wall 'beep'`
fix change_when example for False 2013-10-18 18:13:13 +00:00			`changed_when: False`
Document changed_when clause 2013-07-14 19:43:10 +00:00
Document any_errors_fatal and max_fail_percentage 2015-08-12 09:48:42 +00:00			`Aborting the play`
			`````````````````

			`Sometimes it's desirable to abort the entire play on failure, not just skip remaining tasks for a host.`

Small change so the YAML renders as code. The line above the last YAML code block in this document did not have the requisite :: characters, so the YAML did not render correctly in the docs. 2015-09-15 23:37:20 +00:00			The ``any_errors_fatal`` play option will mark all hosts as failed if any fails, causing an immediate abort::
Document any_errors_fatal and max_fail_percentage 2015-08-12 09:48:42 +00:00
			`- hosts: somehosts`
			`any_errors_fatal: true`
			`roles:`
			`- myrole`

			for finer-grained control ``max_fail_percentage`` can be used to abort the run after a given percentage of hosts has failed.

Various docs reorg and additions 2012-05-13 15:00:02 +00:00
Add see also sections to pages that do not have them. 2013-10-05 16:31:16 +00:00			`.. seealso::`

			:doc:`playbooks`
			`An introduction to playbooks`
			:doc:`playbooks_best_practices`
			`Best practices in playbooks`
			:doc:`playbooks_conditionals`
			`Conditional statements in playbooks`
			:doc:`playbooks_variables`
			`All about variables`
			`User Mailing List <http://groups.google.com/group/ansible-devel>`_
			`Have a question? Stop by the google group!`
			`irc.freenode.net <http://irc.freenode.net>`_
			`#ansible IRC chat channel`

Various docs reorg and additions 2012-05-13 15:00:02 +00:00