cmartinbaughman · June 13, 2013 14:53
diff --git a/linux-kernel-tutorial-gregkh_lxf b/linux-kernel-tutorial-gregkh_lxf
 So run off and install git on your Linux system using the package
 manager you are comfortable with (personally, I use openSUSE, and a
 simple 'zypper install git' does everything that is needed.)

 Then start by cloning the main Linux kernel repository:

  $ mkdir ~/linux
 	$ cd ~/linux
 	$ git clone git://git.kernel.org/pub/scm/linux/kernel/git/torvalds/linux-2.6.git

 This will create the directory 'linux-2.6' within the linux/ directory.
 Everything we do from here out will be within that directory, so go into
 it to start with:
 	$ cd ~/linux/linux-2.6

 Now that you have the raw source code, how do you build it and install
 it on your system?  That is a much larger task, one that is beyond this
 article.  Luckily a whole book has been written on this topic, "Linux
 Kernel in a Nutshell", and can be found free online at:
 	http://www.kroah.com/lkn/
 if you don't want to purchase it.

 So go and get your kernel configured and building, and then come back
 here to figure out what to do next.


 -- Git tips

 Here are a few tips to use with git when working with the kernel source
 tree.  First off, never do your work on the same branch that Linus
 pushes to, called "master".  Create your own branch, and use that
 instead.  This ensures that any changes that are committed to Linus's
 branch upstream, will be able to be updated by you without any problems.

 To create a new branch called 'tutorial' and check it out, do the
 following:
 	$ git branch tutorial
 	$ git checkout tutorial
 That's it.  You are now in the 'tutorial' branch of your kernel
 repository, as can be seen by the following command:
 	$ git branch
 	   master
 	 * tutorial
 The '*' in front of the 'tutorial' name shows that you are on the
 correct branch.

 Now, let's go and make some changes to the kernel code.

 -- What to change

 Wait, you don't know what change you want to make to the Linux kernel
 source tree?  Everything is working just fine for you?  Well, don't
 despair, the Linux kernel developers need all the help they can get, and
 have plenty of code in the tree that is just waiting to get cleaned up.

 The code in the drivers/staging/ tree consists of a lot of drivers that
 do not meet the normal Linux kernel coding guidelines.  The code is in
 that location so that other developers can help on cleaning it up, and
 getting it merged into the main portion of the Linux kernel tree.

 Every driver in the drivers/staging directory contains a TODO file that
 lists the things that need to be done on it in order for the code to be
 moved to the proper location in the kernel tree.  The majority of the
 drivers all contain the following line in their TODO file:
 	- fix checkpatch.pl issues

 Let's look into what this means and how you can help out with this task.

 -- Coding Style

 Every large body of code needs to have a set of coding style rules in
 order for it to be a viable project that a large number of developers
 can work on.  Numerous research studies have been made on this topic,
 and they all conclude that having a common guideline makes a very large
 difference.

 	It is not merely a matter of aesthetics that programs
 	should be written in a particular style.  Rather there
 	is a psychological basis for writing programs in a
 	conventional manner: programmers have strong expectations
 	that other programmers will follow these discourse rules.
 	If the rules are violated, then the utility afforded by
 	the expectations that programmers have built up over time
 	is effectively nullified.”
                              – Soloway & Ehrlich

 What this means is that once programmers get used to a common style, the
 patterns of the code go away when it is looked at, and the meaning shows
 through very easily.

 The goal of any Linux kernel developer is to have other developers help
 find problems in their code, and by keeping all of the code in the same
 format, it makes it much easier for anyone else to pick it up, modify
 it, or notice bugs in it.  As every line of kernel code is reviewed by
 at least 2 developers before it is accepted, having a common style
 guideline is a very important thing.

 The Linux kernel coding style can be found in the file
 Documentation/CodingStyle in the kernel source tree.  The important
 thing to remember when reading it, is not that this style is somehow
 better than any other style, just that it is consistent.

 In order to help developers easily find coding style issues, the script
 scripts/checkpatch.pl in the kernel source tree has been developed.
 This script can point out problems very easily, and should always be run
 by a developer on their changes, instead of having a reviewer waste
 their time by pointing out problems later on.

 The drivers in the drivers/staging/ directory all usually have coding
 style issues as they were developed by people not familiar with the
 Linux kernel guidelines.  One of the first things that needs to be done
 to the code, is to fix it up to follow the correct rules.

 And this is where you come in, by running the checkpatch.pl tool, you
 can find a large number of problems that need to be fixed.

 -- Specific rules

 Let us look at some of the common rules that are part of the kernel
 guidelines.

 --- Whitespace

 The first rule that everyone needs to follow is to use the 'tab'
 character, and not use spaces, to indent code.  Also, the 'tab'
 character should represent 8 spaces.  Following along with the 8
 character tab indentation, the code should not flow past the 80
 character line limit on the right side of the screen.

 Note, numerous developers have complained about the 80 character limit
 recently, and there are some places where it is acceptable to go beyond
 that limit.  If you find that you are being forced to do strange
 line-wrapping formatting just to fit into the 80 character limit, with
 all of your code on the right hand side of the screen, it is better to
 refactor the logic to prevent this from happening in the first place.
 Forcing an 80 character limit, also forces developers to break their
 logic up into smaller, easier to understand chunks, which makes it
 easier to review and follow as well.

 So yes, there is a method to the madness of the 80 character limit.

 --- Braces

 Opening braces should be placed on the same line of the statement they
 are being used for, with one exception as show below.  Closing braces
 should be placed back at the original indentation.  This can be shown
 with the following example:

 	if (error != -ENODEV) {
 	        foo();
 	        bar();
 	}

 If you need to add an else statement to an if statement, put it on the
 same line as the closing brace, as shown here:

 	if (error != -ENODEV) {
 	        foo();
 	        bar();
 	} else {
 		report_error();
 		goto exit;
 	}

 If braces are not needed for a statement, do not put them in, as they
 are unnecessary:

 	if (error != -ENODEV)
 	        foo();
 	else
 		goto exit;

 The one exception for opening braces, is for function declarations,
 those go on a new line as shown here:

 	int function(int *baz)
 	{
 	        do_something(baz);
 	        return 0;
 	}

 -- checkpatch.pl

 With these simple whitespace and brace rules now understood, let us run
 the checkpatch.pl script on some code and see what it tells us:

 	$ ./scripts/checkpatch.pl --help
 	Usage: checkpatch.pl [OPTION]... [FILE]...
 	Version: 0.30

 	Options:
 	  -q, --quiet                quiet
 	  --no-tree                  run without a kernel tree
 	  --no-signoff               do not check for 'Signed-off-by' line
 	  --patch                    treat FILE as patchfile (default)
 	  --emacs                    emacs compile window format
 	  --terse                    one line per report
 	  -f, --file                 treat FILE as regular source file
 	  --subjective, --strict     enable more subjective tests
 	  --root=PATH                PATH to the kernel tree root
 	  --no-summary               suppress the per-file summary
 	  --mailback                 only produce a report in case of warnings/errors
 	  --summary-file             include the filename in summary
 	  --debug KEY=[0|1]          turn on/off debugging of KEY, where KEY is one of
 				     'values', 'possible', 'type', and 'attr' (default
 				     is all off)
 	  --test-only=WORD           report only warnings/errors containing WORD
 				     literally
 	  -h, --help, --version      display this help and exit

 	When FILE is - read standard input.

 Some common options that we will be using is the --terse and --file
 options, as those allow us to see the problems in a much simpler report,
 and they work on an entire file, not just a single patch.

 So, let's pick a file in the kernel and see what checkpatch.pl tells us
 about it:

 	$ ./scripts/checkpatch.pl --file --terse drivers/staging/comedi/drivers/ni_labpc.c
 	drivers/staging/comedi/drivers/ni_labpc.c:4: WARNING: line over 80 characters
 	...
 	drivers/staging/comedi/drivers/ni_labpc.c:486: WARNING: braces {} are not necessary for single statement blocks
 	drivers/staging/comedi/drivers/ni_labpc.c:489: WARNING: braces {} are not necessary for single statement blocks
 	...
 	drivers/staging/comedi/drivers/ni_labpc.c:587: WARNING: suspect code indent for conditional statements (8, 0)
 	...
 	drivers/staging/comedi/drivers/ni_labpc.c:743: WARNING: printk() should include KERN_ facility level
 	drivers/staging/comedi/drivers/ni_labpc.c:750: WARNING: kfree(NULL) is safe this check is probably not required
 	...
 	drivers/staging/comedi/drivers/ni_labpc.c:2028: WARNING: EXPORT_SYMBOL(foo); should immediately follow its function/variable
 	total: 0 errors, 76 warnings, 2028 lines checked


 I've removed a lot of the warnings from the above output, as there was a
 total of 76 of them, and they are all variants of the above ones.

 As can be seen, the checkpatch.pl tool points out where the code has
 gone beyond the 80 character limit, and where braces were used that they
 were not needed, as well as a few other things that should be cleaned up
 in the file.

 Now that we know what needs to be done, fire up your favorite editor,
 and let us fix something.  How about the brace warning, that should be
 simple to resolve.

 Looking at the original code, lines 486-490 look like the following:

        if (irq) {
                printk(", irq %u", irq);
        }
        if (dma_chan) {
                printk(", dma %u", dma_chan);
        }

 A simple removal of those extra braces results in:
        if (irq)
                printk(", irq %u", irq);
        if (dma_chan)
                printk(", dma %u", dma_chan);

 Save the file, and run the checkpatch tool again to verify that the
 warning is gone:
 	$ ./scripts/checkpatch.pl --file --terse drivers/staging/comedi/drivers/ni_labpc.c | grep 486
 	$

 And of course build the file to verify that you did not break anything:
 	$ make drivers/staging/comedi/drivers/ni_labpc.o
 	  CHK     include/linux/version.h
 	  CHK     include/generated/utsrelease.h
 	  CALL    scripts/checksyscalls.sh
 	  CC [M]  drivers/staging/comedi/drivers/ni_labpc.o

 Yes, it still builds, so all is good.

 Great, you have now made your first kernel code fix!

 But, how do you take this change, and get it to the kernel developers in
 the format that they can apply it?

 -- More git fun

 As you edited this file within a git repository, your change to it is
 caught by git.  This can be seen by running the 'git status' command:
 	$ git status
 	# On branch tutorial
 	# Changed but not updated:
 	#   (use "git add <file>..." to update what will be committed)
 	#   (use "git checkout -- <file>..." to discard changes in working directory)
 	#
 	#	modified:   drivers/staging/comedi/drivers/ni_labpc.c
 	#
 	no changes added to commit (use "git add" and/or "git commit -a")

 This output shows that we are on the branch called 'tutorial', and that
 we have one file modified at the moment, the ni_labpc.c file.

 If we ask for git to show what we changed, we will see the actual lines:

 	$ git diff
 	diff --git a/drivers/staging/comedi/drivers/ni_labpc.c b/drivers/staging/comedi/drivers/ni_labpc.c
 	index dc3f398..a01e35d 100644
 	--- a/drivers/staging/comedi/drivers/ni_labpc.c
 	+++ b/drivers/staging/comedi/drivers/ni_labpc.c
 	@@ -483,12 +483,10 @@ int labpc_common_attach(struct comedi_device *dev, unsigned long iobase,
 	 
 	        printk("comedi%d: ni_labpc: %s, io 0x%lx", dev->minor, thisboard->name,
 	               iobase);
 	-       if (irq) {
 	+       if (irq)
 	                printk(", irq %u", irq);
 	-       }
 	-       if (dma_chan) {
 	+       if (dma_chan)
 	                printk(", dma %u", dma_chan);
 	-       }
 	        printk("\n");
 	 
 	        if (iobase == 0) {

 This output is in the format that the tool 'patch' can use to apply a
 change to a body of code.  The leading '-' and '+' on some lines show
 what lines are removed, and what lines are added.  Reading these diff
 outputs soon becomes natural, and is the format in which you need to
 send to the kernel maintainer to get the change accepted.

 --- Description, description, description

 The raw diff output shows what code is changed, but for every kernel
 patch, more information needs to be provided in order for it to be
 accepted.  This "metadata" is as important as the code changes, as it is
 used to show who made the change, why the change was made, and who
 reviewed the change.

 Here is a sample change that was accepted into the Linux kernel tree a
 while ago:

 	USB: otg: Fix bug on remove path without transceiver

 	In the case where a gadget driver is removed while no
 	transceiver was found at probe time, a bug in
 	otg_put_transceiver() will trigger.

 	Signed-off-by: Robert Jarzmik <robert.jarzmik@free.fr>
 	Acked-by: David Brownell <dbrownell@users.sourceforge.net>
 	Signed-off-by: Greg Kroah-Hartman <gregkh@suse.de>

 	--- a/drivers/usb/otg/otg.c
 	+++ b/drivers/usb/otg/otg.c
 	@@ -43,7 +43,8 @@ EXPORT_SYMBOL(otg_get_transceiver);
 	  void otg_put_transceiver(struct otg_transceiver *x)
 	  {
 	-        put_device(x->dev);
 	+        if (x)
 	+                put_device(x->dev);
 	  }


 The first line of the change, is a one line summary of what part of the
 kernel the change is for, and very briefly, what it does:
 	USB: otg: Fix bug on remove path without transceiver

 Then comes a more descriptive paragraph that explains why the change is
 needed:
 	In the case where a gadget driver is removed while no
 	transceiver was found at probe time, a bug in
 	otg_put_transceiver() will trigger.

 After that, comes a few lines that show who made and reviewed the patch:
 	Signed-off-by: Robert Jarzmik <robert.jarzmik@free.fr>
 	Acked-by: David Brownell <dbrownell@users.sourceforge.net>
 	Signed-off-by: Greg Kroah-Hartman <gregkh@suse.de>

 The term "Signed-off-by:" refers to the ability for the developer to
 properly claim that they are allowed to make this change, and offer it
 up under the acceptable license to be able for it to be added to the
 Linux kernel source tree.  This agreement is called the "Developer's
 Certificate of Origin" and can be found in full in the file,
 Documentation/SubmittingPatches in the Linux kernel source tree.

 A brief summary of what the Developer's Certificate of Origin consists
 of, is the following:

 	(a) I created this change; or
 	(b) Based this on a previous work with a
 	      compatible license; or
 	(c) Provided to me by (a), (b), or (c) and not
 	     modified
 	(d) This contribution is public.

 It is a very simple to understand agreement, and ensures that everyone
 involved knows that the change is legally acceptable.

 Every developer who the patch goes through, adds their "Signed-off-by:"
 to it as the patch flows through the developer and maintainer chain
 before it is accepted into the Linux kernel source tree.  This ensures
 that every line of code in the Linux kernel, can be tracked back to the
 developer who created it, and the developers who reviewed it.


 -- Creating our patch

 Now that we know how a patch is structured, we can create ours.

 First, tell git to check in the change that we made:
 	$ git commit drivers/staging/comedi/drivers/ni_labpc.c

 git will fire up your favorite editor and place you in it, with the
 following information already present:

 	# Please enter the commit message for your changes. Lines starting
 	# with '#' will be ignored, and an empty message aborts the commit.
 	# Explicit paths specified without -i nor -o; assuming --only paths...
 	# On branch tutorial
 	# Changes to be committed:
 	#   (use "git reset HEAD <file>..." to unstage)
 	#
 	#       modified:   drivers/staging/comedi/drivers/ni_labpc.c

 Create a summary line for the patch:
 	Staging: comedi: fix brace coding style issue in ni_labpc.c

 And then a more descriptive paragraph:

 	This is a patch to the ni_labpc.c file that fixes up a brace
 	warning found by the checkpatch.pl tool

 Then add your Signed-off-by: line:

 	Signed-off-by: Greg Kroah-Hartman <gregkh@suse.de>

 Then save the file and git will make the commit, printing out the
 following:
 	[tutorial 60de825] Staging: comedi: fix brace coding style issue in ni_labpc.c
 	 1 files changed, 2 insertions(+), 4 deletions(-)

 If you use the command 'git show HEAD' to see the most recent change, it
 will show you the full commit you made:
 	$ git show HEAD

 	commit 60de825964d99dee56108ce4c985a7cfc984e402
 	Author: Greg Kroah-Hartman <gregkh@suse.de>
 	Date:   Sat Jan 9 12:07:40 2010 -0800

 	    Staging: comedi: fix brace coding style issue in ni_labpc.c

 	    This is a patch to the ni_labpc.c file that fixes up a brace
 	    warning found by the checkpatch.pl tool

 	    Signed-off-by: My Name <my_name@my_email_domain>

 	diff --git a/drivers/staging/comedi/drivers/ni_labpc.c b/drivers/staging/comedi/drivers/ni_labpc.c
 	index dc3f398..a01e35d 100644
 	--- a/drivers/staging/comedi/drivers/ni_labpc.c
 	+++ b/drivers/staging/comedi/drivers/ni_labpc.c
 	@@ -483,12 +483,10 @@ int labpc_common_attach(struct comedi_device *dev, unsigned long iobase,
 	 
 		printk("comedi%d: ni_labpc: %s, io 0x%lx", dev->minor, thisboard->name,
 		       iobase);
 	-       if (irq) {
 	+       if (irq)
 			printk(", irq %u", irq);
 	-       }
 	-       if (dma_chan) {
 	+       if (dma_chan)
 			printk(", dma %u", dma_chan);
 	-       }
 		printk("\n");
 	 
 		if (iobase == 0) {


 You are now finished creating your first kernel patch!

 -- Getting your change into the kernel tree

 Now that you have created the patch, how do you get it into the kernel
 tree?  Linux kernel development primarily still happens through email,
 with patches being sent through email, and review happening that way.

 First off, let's export our patch in a format that we can use to email
 it to the maintainer who will be responsible for accepting our patch.

 To do that, once again, git has a command 'format-patch' that you can
 use:
 	$ git format-patch master..tutorial
 	0001-Staging-comedi-fix-brace-coding-style-issue-in-ni_la.patch

 In this command, we are creating all patches that exist in the
 difference from the branch 'master' (which is Linus's branch, remember
 way back at the beginning?) and our private branch, called 'tutorial'.
 This consists of only one change, our patch.  It is now in the file
 0001-Staging-comedi-fix-brace-coding-style-issue-in-ni_la.patch in our
 directory in a format that we can send off.

 Before we attempt to send the patch off, we should verify that our patch
 is in the correct format, and does not add any errors to the kernel tree
 as far as coding style issues go.  To do that, we use the checkpatch.pl
 script again:
 	$ ./scripts/checkpatch.pl 0001-Staging-comedi-fix-brace-coding-style-issue-in-ni_la.patch
 	total: 0 errors, 0 warnings, 14 lines checked

 	0001-Staging-comedi-fix-brace-coding-style-issue-in-ni_la.patch has no obvious style problems and is ready for submission.

 All is good, so it is safe to submit this change.

 But, who do we send it to?  Once again, the kernel developers have made
 this very simple, with a script that will tell you who needs to be
 notified.  This script is called, 'get_maintainer.pl', and is also in
 the scripts/ subdirectory in the kernel source tree.  This script looks
 at the files you have modified in the patch, and matches it up with the
 information in the MAINTAINERS file in the kernel source tree that
 describes who is responsible for what portion of the kernel, as well as
 looking at the past history of the files being modified, in order to
 come up with the names and email addresses of the people, and mailing
 lists, that should be notified of this patch.

 Running this script on our patch, results in the following:
 	$ ./scripts/get_maintainer.pl 0001-Staging-comedi-fix-brace-coding-style-issue-in-ni_la.patch
 	Greg Kroah-Hartman <gregkh@suse.de>
 	Bill Pemberton <wfp5p@virginia.edu>
 	devel@driverdev.osuosl.org
 	linux-kernel@vger.kernel.org

 These are the addresses we need to send the patch to.

 -- Sending an email

 So, we should just bring up our favorite email client and send the patch
 off to the list of addresses that get_maintainer.pl told us about,
 right?

 Wait, not so fast.  Almost all common email clients do nasty things with
 patch files, wrapping lines when they should not be wrapped, changing
 tabs into spaces, eating spaces when they shouldn't, and all sorts of
 other nasty things (can you say base64-encoded attachments?)  Also, some
 email servers are known for mangling patches even if you happen to send
 the patch correctly.  Exchange, Groupwise, and Lotus Notes have this
 problem, so much so that most Linux kernel development teams at
 companies that use these servers have been forced to set up a Linux
 email server somewhere just to get patches out to the community in the
 proper way.

 For details about all of these common problems, and how to properly
 configure a large number of email clients, take a look at the file,
 Documentation/email-clients.txt in the kernel source tree.  It will help
 you out if you want to use your normal email client to send patches.

 If after reading the email-clients.txt file, your email client still
 does not work properly, git can again come to your rescue.

 Git has a way to send patches created with 'git format-patch' out
 through email to the developers who need it.  The 'git send-email
 command handles this all for us:
 	$ git send-email --to gregkh@suse.de --to wfp5p@virginia.edu \
 	   --cc devel@driverdev.osuosl.org \
 	   --cc linux-kernel@vger.kernel.org \
 	   0001-Staging-comedi-fix-brace-coding-style-issue-in-ni_la.patch
 will send the patch we created to the proper developers and CC: the
 proper mailing lists.

 For details on how to configure 'git send-email' to work with your SMTP
 server, or firewall, or anything else, see the man page:
 	$ git send-email --help

 -- Now what?

 Now that you have successfully created a patch and sent it off, what
 next?  The developer who you sent it to should respond by email in a few
 days with either a nice, "thanks for the patch, I have applied it." or
 possibly some comments for changes that you should make in order to get
 it accepted.  If you have not heard anything within a week, send it
 again, don't be worried about being annoying, persistence is the key to
 getting a busy kernel subsystem maintainer's attention.

 So there you have it, the simple steps involved in creating, committing,
 and sending off a Linux kernel patch.  Hopefully this means that
 everyone reading this article will soon send in their own kernel patch,
 and after having fun doing that, continuing to contribute to the largest
 software project in the history of computing.
No results found