Tree - source-git/aide - CentOS Git server

source-git / aide

Files

Commit: a814047c621b8bb7fa7e579557606e189ffd7e1d
Blob Blame History Raw
<?xml version="1.0" encoding="utf-8" ?>
<!DOCTYPE html PUBLIC "-//W3C//DTD XHTML 1.0 Strict//EN"
    "http://www.w3.org/TR/xhtml1/DTD/xhtml1-strict.dtd">
<html xmlns="http://www.w3.org/1999/xhtml">
<head><title>AIDE Manual version 0.16</title>
</head>
<body>

<h1>The AIDE manual</h1>
<h2><a name="about">About this document</a></h2>
<p>
<strong>This manual is by no means complete, usable, readable, comprehensible,
or error free.</strong>
</p>
<p>
<strong>If you have any corrections, additions or constructive comments, please
report them as bugs, patches or feature requests
<a href="http://sourceforge.net/projects/aide">here</a>.
</strong></p>
<p>
This document was originally written by Rami Lehti
<a href="mailto:rammer@cs.tut.fi">&lt;rammer@cs.tut.fi&gt;</a>
with additions made by Marc Haber
<a href="mailto:mh+aide-manual@zugschlus.de">&lt;mh+aide-manual@zugschlus.de&gt;</a>
, Richard van den Berg
<a href="mailto:richard@vdberg.org">&lt;richard@vdberg.org&gt;</a>
and Hannes von Haugwitz
.
</p>

<h2><a name="toc">Table of Contents</a></h2>
<ol>
    <li><a href="#about">About this document</a></li>
    <li><a href="#toc">Table of Contents</a></li>
    <li><a href="#whatisAIDE">What is AIDE?</a></li>
    <li><a href="#compilation">Compiling AIDE</a></li>
    <li><a href="#config">Configuration</a></li>
    <li><a href="#usage">Usage</a></li>
    <li><a href="#signing">Database and config signing</a></li>
    <li><a href="#misc">Miscellaneous</a></li>
    <li><a href="#general">General guidelines for security</a></li>
</ol>


<h2><a name="whatisAIDE">What is AIDE?</a></h2>
<p>
AIDE (Advanced Intrusion Detection Environment) is an intrusion
detection program. More specifically a file integrity checker.
</p>
<p>
AIDE constructs a database of the files specified in aide.conf, AIDE's configuration file.
The AIDE database stores various file attributes including:
file type, permissions, inode number, user, group, file size, mtime and ctime,
atime, growing size, number of links and link name.
AIDE also creates a cryptographic checksum or hash of each file using
one or a combination of the following message digest algorithms: sha1,
sha256, sha512, md5, rmd160, tiger, haval, crc32 (gost and whirlpool can be
compiled in if mhash support is available).
Additionally, the attributes acl, xattr, selinux and e2fsattrs can be used when
explicitly enabled during compile time.
</p><p>
Typically, a system administrator will create an AIDE database on a
new system before it is brought onto the network. This first AIDE
database is a snapshot of the system in it's normal state and the
yardstick by which all subsequent updates and changes will be
measured. The database should contain information about key system
binaries, libraries, header files, all files
that are expected to remain the same over time. The database probably
should not contain information about files which change frequently
like log files, mail spools, proc filesystems, user's home
directories, or temporary directories.
</p><p>
After a break-in, an administrator may begin by examining the system
using system tools like ls, ps, netstat, and who ---
 the very tools most likely to be trojaned. Imagine that ls has been
doctored to not show
any file named "sniffedpackets.log" and that ps and netstat have been
rewritten to not show any information for a process named
"sniffdaemond".
Even an administrator who had previously printed out on paper the
dates and sizes of these key system files can not be certain by
comparison that they have not been modified in some way. File dates
and sizes can be manipulated, some better root-kits make this trivial.
</p><p>
While it is possible to manipulate file dates and sizes, it is much
more difficult to manipulate a single cryptographic checksum like md5,
and exponentially more difficult to manipulate each of the entire
array of checksums that AIDE supports. By rerunning AIDE after a
break-in, a system administrator can quickly identify changes to key
files and have a fairly high degree of confidence as to the accuracy
of these findings.
</p>
<p>
Unfortunately, AIDE can not provide absolute sureness about change in
files. Like any other system file, AIDE's binary and/or database can
also be altered.
</p>



<h2><a name="compilation">Compiling AIDE</a></h2>
<h3>I'm in a hurry. Bottom line about compilation.</h3>
<p>
After you have installed all the necessary software do
<code>./configure;make;make install</code> in the main AIDE
directory of the unpacked source tree. <b>You should carefully think
about the configuration and what a possible hacker can do if
he/her/they/it has root access.</b></p>

<h3>Getting all that is needed</h3>
<p>
Before you can compile AIDE you must have certain things:
</p>
<ul>
    <li><a href="http://aide.sourceforge.net">AIDE source code</a></li>
    <li>ANSI C-compiler (<a href="https://gcc.gnu.org">GCC</a> will do just fine)</li>
    <li><a href="http://flex.sourceforge.net">GNU Flex</a></li>
    <li><a href="https://www.gnu.org/software/bison/">GNU Bison</a></li>
    <li><a href="https://www.gnu.org/software/make/">GNU Make</a></li>
    <li><a href="http://pcre.org/">PCRE library</a></li>
    <li><a href="http://mhash.sourceforge.net">Mhash library</a> (highly recommended)</li>
    <li><a href="http://zlib.net/">zlib</a> (recommended)</li>
</ul>
<p>Please check to see if there are mirrors available.</p>
<p>
Once you have the source code of AIDE you should unpack it. If you
have GNU tar then the command is <code>tar zxvf
aide-&lt;VERSION_NUMBER&gt;.tar.gz</code>
</p>
<h3>Source Code Verification</h3>
<p>
It is <strong>highly recommended</strong> to verify the signature of your
downloaded source code. You can either verify the source tarball or the git
tag.
</p>
<p>
To check the supplied signature with GnuPG:
</p>
<pre>
<code>
$ gpg --verify aide-&lt;VERSION_NUMBER&gt;.tar.gz.asc
</code>
</pre>
<p>
To validate the gpg signature of the git tag:
</p>
<pre>
<code>
$ git verify-tag v&lt;VERSION_NUMBER&gt;
</code>
</pre>
<p>The current public key is published on <a href="http://aide.sourceforge.net#gpgkey">aide.sourceforge.net</a>.</p>
<p>
If you do not have that key, you can get it from one of the well known PGP key
servers.
</p>
<p>
You have to make sure that the key you install is not a faked one. You
can do this with reasonable assurance by comparing the output of
</p>
<pre>
<code>
$ gpg --fingerprint 0x&lt;KEYID&gt;
</code>
</pre>
<p>
with the fingerprint published elsewhere.
</p>
<h3>Compile-time configuration</h3>
<p>
Next you must use the configure script found in AIDE's source code
package to configure the compilation process.</p><p>
There are several options you can select to configure. You can find out
what options are available with <code>./configure --help</code>
command. Most of the time you do not need to give any options.
You can just use configure without any parameters.</p><p>

If you want to change the directory where AIDE is installed you can
use --prefix option. For example <code>./configure --prefix=/usr
</code>
</p>
<h3>Compilation and installation</h3>
<p>
The compilation is done by simply typing <code>make</code>. You can
now type <code>make install</code> to install the binary and the
manual pages. The binary however should be installed on read-only
media or in some other tamperproof place. Also the databases should
be kept somewhere where a possible intruder cannot change them.</p>


<h2><a name="config">Configuration</a></h2>
<p>
Next you have to create a configuration file. You can find
more documentation for this in aide.conf(5) manual page.
</p>
<p>
There are three types of lines in aide.conf:
</p>
<ul>
    <li>configuration lines - used to set configuration parameters and define/undefine variables</li>
    <li>(restricted) selection lines - indicate which files will be added to the database</li>
    <li>macro lines - define or undefine variables within the config file</li>
</ul>
<p>
Lines beginning with # are ignored as comments.
</p>
<p>
Here is an example configuration.</p>


<pre>
#AIDE conf

   # Here are all the things we can check - these are the default rules
   #
   #p:      permissions
   #ftype:  file type
   #i:      inode
   #n:      number of links
   #l:      link name
   #u:      user
   #g:      group
   #s:      size
   #b:      block count
   #m:      mtime
   #a:      atime
   #c:      ctime
   #S:      check for growing size
   #I:      ignore changed filename
   #md5:    md5 checksum
   #sha1:   sha1 checksum
   #sha256: sha256 checksum
   #sha512: sha512 checksum
   #rmd160: rmd160 checksum
   #tiger:  tiger checksum
   #haval:  haval checksum
   #crc32:  crc32 checksum
   #R:      p+ftupe+i+l+n+u+g+s+m+c+md5
   #L:      p+ftype+i+l+n+u+g
   #E:      Empty group
   #>:      Growing file p+ftype+l+u+g+i+n+S
   #The following are available if you have mhash support enabled:
   #gost:   gost checksum
   #whirlpool: whirlpool checksum
   #The following are available and added to the default groups R, L and >
   #only when explicitly enabled using configure:
   #acl:    access control list
   #selinux SELinux security context
   #xattrs:  extended file attributes
   #e2fsattrs: file attributes on a second extended file system

   # You can also create custom rules - my home made rule definition goes like this
   #
   MyRule = p+i+n+u+g+s+b+m+c+md5+sha1

   # Next decide what directories/files you want in the database

   /etc p+i+u+g     #check only permissions, inode, user and group for etc
   /bin MyRule      # apply the custom rule to the files in bin
   /sbin MyRule     # apply the same custom rule to the files in sbin
   /var MyRule
   !/var/log/.*     # ignore the log dir it changes too often
   !/var/spool/.*   # ignore spool dirs as they change too often
   !/var/adm/utmp$  # ignore the file /var/adm/utmp


</pre>



<p>Here we include files in /etc, /bin and /sbin. We also include
/var but ignore /var/log, /var/spool and a single file /var/adm/utmp.
</p><p>
It is generally a good idea to ignore directories that frequently
change, unless you want to read long reports.
It is good practice to exclude tmp directories, mail spools, log
directories, proc filesystems, user's home directories, web content
directories, anything that changes regularly. It is also good practice to
include all system binaries, libraries, include files, system source
files. It will also be a good idea to include directories you don't
often look in like /dev /usr/man/.*usr/. Of course you'll want to
include as many files as practical, but think about what you include.
</p>
<p>
One example: If
you have a block device whose owner is changing frequently, you can
configure aide to just
check the attributes that do not normally change (inode, number of
links, ctime).
</p>
<p>
Note that if you are referring to a single file you should add $ to
the end of the regexp. This matches to the name of the file exactly
and does not include any other files that might have the same
beginning. In the example, all filenames beginning with
/var/adm/utmp would be ignored if there were no dollar sign at the
end of the last line. An intruder could then create a
directory called /var/adm/utmp_root_kit and place all the files
he/she/they wanted there and they would be ignored by AIDE.
</p>
<h3>Special group definitions</h3>
<p>
There are several special group definitions to tweak what attributes are
printed in the report. First report_force_attrs lists those attributes
that are always printed from changed files. For example, if you say
</p>
<pre>
report_force_attrs = u+g
</pre>
<p>
and the size of a file changes, it's user and group id will also be printed
in the report. Secondly, report_ignore_added_attrs,
report_ignore_removed_attrs and report_ignore_changed_attrs define which
attributes to ignore from the report. For example, if you define
</p>
<pre>
report_ignore_changed_attrs = b
</pre>
<p>
and the size of a file changes, it's block count will not be printed in the
report, even if it did change as well.
</p>
<p>
If an attribute is both ignored and forced the attribute is not considered for
file change but printed in the final report if the file has been otherwise
changed.
</p>
<h3>Troubleshooting your config</h3>
<p>Making a config file is a lot of hard work and must be done on a case
by case bases. Don't give up simply because you don't get it right
the first time around. This section gives you a few hints on how to debug
your config.
</p>
<p>
You can use <code>aide --verbose=255</code> to generate a lot of debug
output to help you see which files get added and which are discarded.
The following section gives some more information about
AIDE's rule matching algorithm.
</p>
<h4>Understanding AIDE rule matching</h4>
<p>
Before reading this you should have basic understanding of how regular
expressions in general and Perl Compatible Regular Expressions in particular
work. There are several good books about this. Several Perl-books also have
decent explanations about this subject.
</p>
<p>
As you already know, aide has three types of selection lines:
</p>
<ul>
<li>Regular selection lines, beginning with "/".</li>
<li>Equals selection lines, beginning with "=".</li>
<li>Negative selection lines, beginning with "!".</li>
</ul>
<p>
The string following the first character is taken as a regular
expression matching to a complete filename, including the path. In a
regular selection rule, the slash is included in the regular
expression. An implicit ^ is added in front of each rule. A group
definition follows the regular expression.
</p>
<p>
When reading the configuration file, aide internally builds a tree
that roughly resembles the directory tree to be checked. Each node
corresponds to a directory, and each node has one rule list for the
associated regular selection lines, one for the associated negative
selection lines and one for the associated equals selection lines. If
there is no associated rule, the respective list may be empty.
</p>
<p>
aide tries to place a rule as far down in the tree as possible while
still assuring that it is above all files that it matches. This is
determined by the first "special" regexp character in the rule. For
example, <code>!/proc</code> would be placed in the root node,
<code>!/proc/.*</code> would be placed in the /proc node,
<code>!/var/log/syslog*</code> is placed in the /var/log node and,
finally, <code>!/home/[a-z0-9]+/.bashrc$</code> is placed in the /home
node.
</p>
<p>
The algorithm that aide uses for rule matching is described in the
following paragraphs. The pseudocode is an adaption from src/gen_list.c.
</p>
<pre>
<code>
check_node_for_match(node,filename,first_time)
    if (first_time)
            check(equals list for this node)

    check(regular list for this node)

    if (node is not the root node)
        check_node_for_match(nodes parent,filename,false)

    if (this file is about to be added)
        check(negative list for this node)

    return (info about whether this file should be added or not and how)
</code>
</pre>
<p>
When aide needs to determine whether a file found in the file system is
to be checked, it first determines the deepest possible node x to
match the current file against (that algorithm is not part of the
pseudocode above), and then calls check-node_for_match(x, filename,
true). So, the recursion starts at the deepest possible match.
</p>
<p>
As it can also be seen, equals selection lines are only checked in the
first recursion step, thus providing some kind of speed optimization
by reducing the number of necessary regular expression evaluations,
which is a quite expensive operation.
</p>
<h5>Pitfalls</h5>
<p>
There are some side-effects from this algorithm that might seem
strange at first. For example if you have the following rules:
</p>
<pre>
<code>
/ R
=/var/log/messages$ R+a
!/var/log/messages.*
</code>
</pre>
<p>
This is what you might write if you want to check /var/log/messages
but not /var/log/messages.0 and /var/log/messages.1 etc. However since
the negative selection rules are checked last and .* can match to an
empty string /var/log/messages is not added to the database. The
following is a more correct way of doing it.
</p>
<pre>
<code>
/ R
=/var/log/messages$ R+a
!/var/log/messages\.[0-9]$
</code>
</pre>
<p>
Now only messages files ending in number 0-9 are not included in the
database. Note an intruder could disguise a rootkit by creating a
directory called messages.9. If messages.9 does not already exist that
is.
</p>
<p>
Consider the following rules:
</p>
<pre>
<code>
/ n+p+l+i+u+g+s+b+m+c+md5+sha1+rmd160+haval+gost+crc32+tiger
/etc$ n+p+l+i+u+g
/etc/resolv.conf$ n+p+l+u+g
</code>
</pre>
<p>
This way, changing /etc/resolv.conf will also report /etc as having
their mtime and ctime changed, even if /etc is configured not to be
checked for mtime and ctime. The reason is that aide only uses a
deepest-match algorithm to find the tree node to search, but a
first-match algorithm inside the node. Since /etc is in the /
directory, /etc will match the rule for the root directory and ignore
the specialized /etc rule.
</p>
<p>
Rearranging the configuration like this:
</p>
<pre>
<code>
/etc/resolv.conf$ n+p+l+u+g
/etc$ n+p+l+i+u+g
/ n+p+l+i+u+g+s+b+m+c+md5+sha1+rmd160+haval+gost+crc32+tiger
</code>
</pre>
<p>
will solve the issue. It is generally a good idea to write the most
general rules last.
</p>

<h2><a name="usage">Usage</a></h2>
<p>
First you must create a database against which future checks are
performed. This should be done immediately after the operating system
and applications have been installed, before the machine is plugged
into a network. You can do this by giving the command
<code>aide --init</code>.
This creates a database that contains all of the files that you
selected in your config file. The newly created database should now be
moved to a secure location such as read-only media. You should also
place the configuration file and the AIDE binary and preferably the
manual pages and this manual on that media also. Please remember to
edit the configuration file so that the input database is read from
that read-only media. The config file should not be kept on the
target machine. The attacker could read the config file and alter it
and if he does alter it he could place his rootkit in a place that
AIDE does not check. So the read-only media should be accessible only
during the check.</p>
<p>
Now you are all set to go. You can now check the integrity of the
files. This can be done by giving the command
<code>aide --check</code>.
AIDE now reads the database and compares it to the files found on the
disk. AIDE may find changes in places that you might not expect. For
instance tty devices often change owners and permissions. You may want
to read long reports and that is up to you to decide. But most of us
do not have the time or the inclination read through tons of garbage
every day, so you should trim the config file to include only the
files and attributes of certain files that should not change. But keep
in mind that you should not ignore too much as that leaves you open
for an attack. An intruder might place his/her/its/their root kit in a
directory that you have ignored completely. One good example is
/var/spool/lp or something similar. This is the place that lp daemon
stores its temporary files. You should not ignore it completely
however. You should only ignore the format of files that you lp daemon
keeps creating. And remember to use the $-sign at the end of your
regexps. This stops someone from creating a directory that is ignored
along with its contents.</p>
<p>
Now that you have trimmed your config file you should update the
database. This can be done by:
<code>aide --update</code>
The update command also does the same thing as check but it
creates a new database. This database should now be placed on
that read-only media along with the new config file. The check, trim,
update cycle should be repeated as long as necessary. I recommend that
the config file should be reviewed once in a while. The definition of
"a while" depends on your paranoia. Some might want do it daily after
each check. Some might want to do it weekly.</p>
<p>
There is usually some drift in the databases. What I mean by drift is
that new files are created, config files of applications are edited,
tons of small changes pile up until the report becomes
unreadable. This can be avoided by updating the database once in a
while. I myself run the update every night. But, I don't replace the
input database nearly as often. The replacement of the input datbase
should always be a manual operation. This should not be automated.
</p>
<p>
There is also an alternative way of doing this. This method may be
preferable for people that have lots of machines that run aide.
You can run
<code>aide --init</code>
on all of the hosts and move the generated databases to a central host
where you compare different versions of the databases with
<code>aide --compare</code>
This has the benefit of freeing up resources on the monitored
machines.
</p>


<h2><a name="signing">Database and config signing</a></h2>
<p>
The security of AIDE can be increased by signing the configuration and/or
database. When a database is signed, and it is changed manually, AIDE will
refuse to use it. Likewise, if a configuration is signed, AIDE will not use
it until the embedded hash is updated as well.
</p>

<p>
To make use of the signing features, use these options to the configure
script:
</p>
<dl>
<dt>
  --with-confighmactype=TYPE
</dt>
<dd>
                          Hash type to use for checking config. Valid values
                          are md5 and sha1.
</dd>
<dt>
  --with-confighmackey=KEY
</dt>
<dd>
                          HMAC hash key to use for checking config. Must be a
                          base64 encoded byte stream. Maximum string length is
                          31 chars.
</dd>
<dt>
  --with-dbhmactype=TYPE
</dt>
<dd>
                          Hash type to use for checking db. Valid values are
                          md5 and sha1.
</dd>
<dt>
  --with-dbhmackey=KEY
</dt>
<dd>
                          HMAC hash key to use for checking db. Must be a base64
                          encoded byte stream. Maximum string length is 31
                          chars.
</dd>
</dl>
<p>
The base64 encoding was chosen so that the keys are not limited to printable
characters. You can use a local base64 tool
to convert the keys to the right format. Then run configure, for example:
</p>


<p><code>
./configure --with-confighmactype=sha1
-with-confighmackey="YWlkZSBhaWRlIGFpZGUgYWlkZQo=" --with-dbhmactype=sha1
--with-dbhmackey="YWlkZSBhaWRlIGFpZGUgYWlkZQo="
</code></p>

<p>
To make the presence of a valid signature mandatory, the following configure
options can be used:
</p>

<dl>
<dt>
  --enable-forced_dbmd
</dt>
<dd>
                          Forces the file/pipe database's to have checksum.
</dd>
<dt>
  --enable-forced_configmd
</dt>
<dd>
                          Forces the config to have checksum. Also disables
                          --config-check
</dd>
</dl>

<p>
It is also possible to edit the <code>config.h</code> file by hand, and
changing the values of the <code>FORCEDBMD</code> and
<code>FORCECONFIGMD</code> macros.
</p>

<p>
Creating the hash for the aide.db database is done by running <code>aide
--init</code> or <code>aide --update</code>. The hash for the aide.conf
configuration file can be obtained by running <code>aide
--config-check</code>:
</p>

<pre>
$ aide --config-check
Config checked. Use the following to patch your config file.
0a1
> @@begin_config 27GF0+oKj1CvP4tltuibhu8YGIU=
13a15
> @@end_config
</pre>

<p>
The <code>@@begin_config</code> and <code>@@end_config</code> can be added
to the aide.conf file manually, or the output of <code>aide
--config-check</code> can be directly piped into <code>patch</code>:
</p>

<pre>
$ aide --config-check | patch
can't find file to patch at input line 2
Perhaps you should have used the -p or --strip option?
The text leading up to this was:
--------------------------
|Config checked. Use the following to patch your config file.
--------------------------
File to patch: /etc/aide.conf
patching file /etc/aide.conf
</pre>

<p>
Using <code>forced_configmd</code> will make AIDE refuse to use unsigned
configuration files. This also disables the <code>--config-check</code>
option. This only makes sense if you already have a signed configuration, or
if you have an AIDE executable on another machine that can create the signed
configurations for you.
</p>


<h2><a name="misc">Miscellaneous</a></h2>
<p>
The AIDE database can be used to find the real names and places of
files that have been moved to lost+found directory by fsck.
</p>

<h2><a name="general">General guidelines for security</a></h2>
<ol>
    <li>Do not assume anything</li>
    <li>Trust no-one,nothing</li>
    <li>Nothing is secure</li>
    <li>Security is a trade-off with usability</li>
    <li>Paranoia is your friend</li>
</ol>

</body>
</html>