This is a valid Atom 1.0 feed.
This feed is valid, but interoperability with the widest range of feed readers could be improved by implementing the following recommendations.
<link href="http://rkj.github.com//atom.xml" rel="self"/>
^
line 63, column 0: (21 occurrences) [help]
<figure class="highlight"><pre><code class="la ...
<?xml version="1.0" encoding="utf-8"?>
<feed xmlns="http://www.w3.org/2005/Atom">
<title>rkj tech blog</title>
<link href="http://rkj.github.com//atom.xml" rel="self"/>
<updated>2017-03-22T23:18:18+00:00</updated>
<id>http://rkj.github.com/</id>
<author>
<name>Roman Kamyk</name>
<email>roman.kamyk@gmail.com</email>
</author>
<entry>
<title>Parsing huge OpenStreetMap's JOSM file using Nokogiri</title>
<link href="http://rkj.github.com//devcamp/josm/openstreetmap/ruby/parser/tutorial/2011/07/05/Parsing-OpenStreetMap-JOSM-data.html"/>
<updated>2011-07-05T00:00:00+00:00</updated>
<id>http://rkj.github.com//devcamp/josm/openstreetmap/ruby/parser/tutorial/2011/07/05/Parsing-OpenStreetMap-JOSM-data</id>
<content type="html"><h1 id="background">Background</h1>
<p>While attending <a href="/event/ssjs/devmeetigs/2011/07/04/DevCamp.html">SSJS Devcamp</a>
I was having fun with <a href="http://www.mongodb.org">MongoDB</a> and
<a href="http://couchdb.apache.org/">CouchDB</a>. To make some useful tests we had first to gather some data. The plan was to import data into MySQL, next migrate it
to NoSQL dbs and then make some tests/comparisons. After a quick brainstorm we
decided to create database with places/POIs. The greatest free geo repository I
know is <a href="http://openstreetmap.org">OpenStreetMap</a> and they are providing all
theirs data for <a href="http://wiki.openstreetmap.org/wiki/Downloading_data">download</a>.
Whole repository is really huge (almost 17GiB of compressed data), so we
settled for using only Poland data from a <a href="http://download.geofabrik.de/osm/europe/">nice
mirror</a>.</p>
<h1 id="data">Data</h1>
<p>Format specification is described on <a href="http://wiki.openstreetmap.org/wiki/JOSM_file_format">OSM
wiki</a>. It is pretty
straightforward XML and contains mostly roads, but also cities, buildings,
shops and ton of other things. This makes even Poland data pretty big (~150MiB
compressed, ~2.5GiB uncompressed XML file). For us just the POI stuff was
important so we had to go through whole file and cherry pick interesting
entries.</p>
<h1 id="parser">Parser</h1>
<p>Because of the size of the XML, using a <a href="http://en.wikipedia.org/wiki/XML#Document_Object_Model_.28DOM.29">DOM
parser</a> on a
laptop would not work because of memory usage. Obvious solution would be to use
SAX parser, but I consider it to be totally backwards<sup id="fnref:1"><a href="#fn:1" class="footnote">1</a></sup>. I prefer <a href="http://www.xmlpull.org/">Pull
Parsing</a> so I searched for some nice parser,
preferably in Ruby. As it turns out, the excellent
<a href="http://nokogiri.org">Nokogiri</a> have one as
<a href="http://nokogiri.org/Nokogiri/XML/Reader.html">XML::Reader</a>.</p>
<h1 id="code">Code</h1>
<p>The import was broken into two phases. In phase one the interesting data was
taken from the XML and saved as JSON (it could be any other format of course).
In the second phase it was read and inserted into MySQL. It allowed writing
phase two code while phase one was running and was generally more error
resistant (errors in phase two didn’t cause whole import to be rerun).</p>
<p>Phase one code from <a href="https://github.com/rkj/devcamp-ssjs-db/blob/master/osm/parse-osm.rb">GitHub/parse-osm.rb</a>:</p>
<figure class="highlight"><pre><code class="language-ruby" data-lang="ruby"><table style="border-spacing: 0"><tbody><tr><td class="gutter gl" style="text-align: right"><pre class="lineno">1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93</pre></td><td class="code"><pre><span class="c1">#!/usr/bin/env ruby</span>
<span class="c1"># Small script for scraping POIs from JOSM (http://wiki.openstreetmap.org/wiki/JOSM_file_format).</span>
<span class="nb">require</span> <span class="s1">'rubygems'</span>
<span class="nb">require</span> <span class="s1">'bundler/setup'</span>
<span class="nb">require</span> <span class="s1">'nokogiri'</span>
<span class="nb">require</span> <span class="s1">'json'</span>
<span class="nb">require</span> <span class="s1">'ap'</span>
<span class="k">class</span> <span class="nc">Parser</span>
<span class="c1"># Tags in data that we ignore.</span>
<span class="no">IGNORED_TAGS</span> <span class="o">=</span> <span class="p">[</span><span class="s2">"created_by"</span><span class="p">,</span> <span class="s2">"source"</span><span class="p">]</span>
<span class="k">def</span> <span class="nf">initialize</span>
<span class="c1"># Map storing attribute name mapped on count of nodes containing it.</span>
<span class="c1"># It is helpful to see what tags should be taken into account in the first place</span>
<span class="c1"># during importing.</span>
<span class="vi">@popular_attributes</span> <span class="o">=</span> <span class="p">{}</span>
<span class="c1"># number of nodes parsed</span>
<span class="vi">@count</span> <span class="o">=</span> <span class="mi">0</span>
<span class="c1"># number of total xml nodes went through</span>
<span class="vi">@total_count</span> <span class="o">=</span> <span class="mi">0</span>
<span class="c1"># number of entries considered useful as POI</span>
<span class="vi">@included</span> <span class="o">=</span> <span class="mi">0</span>
<span class="k">end</span>
<span class="k">def</span> <span class="nf">parse</span><span class="p">(</span><span class="n">input</span><span class="p">,</span> <span class="n">output</span><span class="p">)</span>
<span class="n">out</span> <span class="o">=</span> <span class="no">File</span><span class="p">.</span><span class="nf">new</span><span class="p">(</span><span class="n">output</span><span class="p">,</span> <span class="s2">"w"</span><span class="p">)</span>
<span class="k">begin</span>
<span class="n">out</span><span class="p">.</span><span class="nf">write</span> <span class="s2">"[</span><span class="se">\n</span><span class="s2">"</span>
<span class="c1"># Nokogiri reader created.</span>
<span class="n">reader</span> <span class="o">=</span> <span class="no">Nokogiri</span><span class="o">::</span><span class="no">XML</span><span class="o">::</span><span class="no">Reader</span><span class="p">(</span><span class="no">File</span><span class="p">.</span><span class="nf">new</span><span class="p">(</span><span class="n">input</span><span class="p">))</span>
<span class="k">while</span> <span class="n">reader</span> <span class="o">=</span> <span class="n">parse_node</span><span class="p">(</span><span class="n">reader</span><span class="p">,</span> <span class="n">out</span><span class="p">)</span>
<span class="k">end</span>
<span class="k">ensure</span>
<span class="n">out</span><span class="p">.</span><span class="nf">write</span> <span class="s2">"{}</span><span class="se">\n</span><span class="s2">]</span><span class="se">\n</span><span class="s2">"</span>
<span class="n">out</span><span class="p">.</span><span class="nf">close</span>
<span class="n">ap</span> <span class="vi">@popular_attributes</span><span class="p">.</span><span class="nf">sort_by</span> <span class="p">{</span> <span class="o">|</span><span class="n">k</span><span class="p">,</span> <span class="n">v</span><span class="o">|</span> <span class="n">v</span><span class="p">}.</span><span class="nf">reverse</span>
<span class="no">STDERR</span><span class="p">.</span><span class="nf">puts</span> <span class="s2">""</span>
<span class="nb">puts</span> <span class="s2">"</span><span class="se">\n</span><span class="si">#{</span><span class="vi">@included</span><span class="si">}</span><span class="s2"> / </span><span class="si">#{</span><span class="vi">@count</span><span class="si">}</span><span class="s2"> / </span><span class="si">#{</span><span class="vi">@total_count</span><span class="si">}</span><span class="se">\t</span><span class="s2">"</span>
<span class="k">end</span>
<span class="k">end</span>
<span class="k">def</span> <span class="nf">parse_node</span><span class="p">(</span><span class="n">r</span><span class="p">,</span> <span class="n">out</span><span class="p">)</span>
<span class="c1"># Search for 'node' tags because they contain data (points). Other tags</span>
<span class="c1"># are discarder.</span>
<span class="p">(</span><span class="n">r</span> <span class="o">=</span> <span class="n">r</span><span class="p">.</span><span class="nf">read</span><span class="p">;</span> <span class="n">progress</span><span class="p">)</span> <span class="k">while</span> <span class="n">r</span> <span class="o">&amp;&amp;</span> <span class="n">r</span><span class="p">.</span><span class="nf">name</span> <span class="o">!=</span> <span class="s1">'node'</span>
<span class="c1"># Stop processing if end of file</span>
<span class="k">return</span> <span class="kp">false</span> <span class="k">unless</span> <span class="n">r</span>
<span class="c1"># Create entry to be enriched with 'tag' data</span>
<span class="n">entry</span> <span class="o">=</span> <span class="p">{</span> <span class="ss">:lat</span> <span class="o">=&gt;</span> <span class="n">r</span><span class="p">.</span><span class="nf">attribute</span><span class="p">(</span><span class="s2">"lat"</span><span class="p">),</span> <span class="ss">:lon</span> <span class="o">=&gt;</span> <span class="n">r</span><span class="p">.</span><span class="nf">attribute</span><span class="p">(</span><span class="s2">"lon"</span><span class="p">)</span> <span class="p">}</span>
<span class="c1"># Required fields to create usable POI.</span>
<span class="n">req</span> <span class="o">=</span> <span class="p">[</span><span class="s2">"name"</span><span class="p">]</span>
<span class="k">while</span> <span class="p">(</span><span class="n">progress</span><span class="p">;</span> <span class="n">r</span> <span class="o">=</span> <span class="n">r</span><span class="p">.</span><span class="nf">read</span><span class="p">)</span>
<span class="c1"># Next node found, so no more tags.</span>
<span class="k">break</span> <span class="k">if</span> <span class="n">r</span><span class="p">.</span><span class="nf">name</span> <span class="o">==</span> <span class="s1">'node'</span>
<span class="c1"># Only 'tag' are interesting.</span>
<span class="k">next</span> <span class="k">unless</span> <span class="n">r</span><span class="p">.</span><span class="nf">name</span> <span class="o">==</span> <span class="s1">'tag'</span>
<span class="c1"># Each tag has form of &lt;tag k="key" v="value" /&gt;</span>
<span class="n">key</span> <span class="o">=</span> <span class="n">r</span><span class="p">.</span><span class="nf">attribute</span> <span class="s2">"k"</span>
<span class="k">unless</span> <span class="no">IGNORED_TAGS</span><span class="p">.</span><span class="nf">include?</span> <span class="n">key</span>
<span class="n">req</span><span class="p">.</span><span class="nf">delete</span> <span class="n">key</span>
<span class="n">entry</span><span class="p">[</span><span class="n">key</span><span class="p">]</span> <span class="o">=</span> <span class="n">r</span><span class="p">.</span><span class="nf">attribute</span> <span class="s2">"v"</span>
<span class="vi">@popular_attributes</span><span class="p">[</span><span class="n">key</span><span class="p">]</span> <span class="o">||=</span> <span class="mi">0</span>
<span class="vi">@popular_attributes</span><span class="p">[</span><span class="n">key</span><span class="p">]</span> <span class="o">+=</span> <span class="mi">1</span>
<span class="k">end</span>
<span class="k">end</span>
<span class="c1"># If all required tags were found.</span>
<span class="k">if</span> <span class="n">req</span><span class="p">.</span><span class="nf">size</span> <span class="o">==</span> <span class="mi">0</span>
<span class="vi">@included</span> <span class="o">+=</span> <span class="mi">1</span>
<span class="n">out</span><span class="p">.</span><span class="nf">write</span><span class="p">(</span><span class="n">entry</span><span class="p">.</span><span class="nf">to_json</span><span class="p">)</span>
<span class="n">out</span><span class="p">.</span><span class="nf">write</span><span class="p">(</span><span class="s2">",</span><span class="se">\n</span><span class="s2">"</span><span class="p">)</span>
<span class="k">end</span>
<span class="n">progress</span><span class="p">(</span><span class="kp">true</span><span class="p">)</span>
<span class="k">return</span> <span class="n">r</span>
<span class="k">end</span>
<span class="c1"># Progress info</span>
<span class="k">def</span> <span class="nf">progress</span><span class="p">(</span><span class="n">entry_found</span> <span class="o">=</span> <span class="kp">false</span><span class="p">)</span>
<span class="vi">@total_count</span> <span class="o">+=</span> <span class="mi">1</span><span class="p">;</span>
<span class="vi">@count</span> <span class="o">+=</span> <span class="mi">1</span> <span class="k">if</span> <span class="n">entry_found</span>
<span class="n">limit</span> <span class="o">=</span> <span class="mi">10000</span>
<span class="k">if</span> <span class="vi">@total_count</span> <span class="o">%</span> <span class="n">limit</span> <span class="o">==</span> <span class="mi">0</span>
<span class="no">STDERR</span><span class="p">.</span><span class="nf">print</span> <span class="s2">"."</span>
<span class="no">STDERR</span><span class="p">.</span><span class="nf">print</span> <span class="s2">"</span><span class="se">\r</span><span class="si">#{</span><span class="vi">@included</span><span class="si">}</span><span class="s2"> / </span><span class="si">#{</span><span class="vi">@count</span><span class="si">}</span><span class="s2"> / </span><span class="si">#{</span><span class="vi">@total_count</span><span class="si">}</span><span class="se">\t</span><span class="s2">"</span> <span class="k">if</span> <span class="vi">@total_count</span> <span class="o">%</span> <span class="p">(</span><span class="n">limit</span> <span class="o">*</span> <span class="mi">50</span><span class="p">)</span> <span class="o">==</span> <span class="mi">0</span>
<span class="no">STDERR</span><span class="p">.</span><span class="nf">flush</span>
<span class="k">end</span>
<span class="k">end</span>
<span class="k">end</span>
<span class="k">if</span> <span class="no">ARGV</span><span class="p">.</span><span class="nf">size</span> <span class="o">&lt;</span> <span class="mi">2</span>
<span class="nb">puts</span> <span class="s2">"Usage: </span><span class="si">#{</span><span class="vg">$PROGRAM_NAME</span><span class="si">}</span><span class="s2"> osm_file output_json"</span>
<span class="nb">exit</span> <span class="mi">1</span>
<span class="k">end</span>
<span class="no">Parser</span><span class="p">.</span><span class="nf">new</span><span class="p">.</span><span class="nf">parse</span> <span class="no">ARGV</span><span class="p">[</span><span class="mi">0</span><span class="p">],</span> <span class="no">ARGV</span><span class="p">[</span><span class="mi">1</span><span class="p">]</span><span class="w">
</span></pre></td></tr></tbody></table></code></pre></figure>
<p>Parsing whole file ran for a little over 250 seconds:</p>
<div class="highlighter-rouge"><pre class="highlight"><code>ruby parse-osm.rb poland.osm poi.json 247.94s user 3.07s system 99% cpu 4:13.08 total
</code></pre>
</div>
<p>Just copying the file take more than half of that:</p>
<div class="highlighter-rouge"><pre class="highlight"><code>cp -i poland.osm del.me 0.03s user 3.80s system 2% cpu 2:32.33 total
</code></pre>
</div>
<p>So the overhead is not too big. While running, the script used less than 4MiB
of RAM, so also very acceptable. The result is 101 793 POI candidates[^2] that
will be imported into the DB in the phase two. It will be described in more
details in a following post, so stay tuned!</p>
<p>react appropriately. Is is just wrong. I guess this API was easy to implement,
but the result is a strange to use, and code using it ends up pretty mangled.
[^2]: Extracted from 14 122 097 JOSM ‘node’s and 89 522 705 XML nodes;that gives processing of
more than 362 440 XML tags per second.</p>
<div class="footnotes">
<ol>
<li id="fn:1">
<p>I mean literally backwards. You, as a client, has to maintain state and&nbsp;<a href="#fnref:1" class="reversefootnote">&#8617;</a></p>
</li>
</ol>
</div>
</content>
</entry>
<entry>
<title>SSJS Devcamp organized by DevMeetings.pl</title>
<link href="http://rkj.github.com//event/ssjs/devmeetigs/2011/07/04/DevCamp.html"/>
<updated>2011-07-04T00:00:00+00:00</updated>
<id>http://rkj.github.com//event/ssjs/devmeetigs/2011/07/04/DevCamp</id>
<content type="html"><p>I have always liked lightweight events, like BarCamp, and have tried to attend
most of locally organized ones. At the beginning of this year I have taken
part in <a href="http://www.jug.poznan.pl/materialy-ze-spotkan/materialy/1-poznanski-code-retreat/">Poznań Code
Retreat</a>
that was really great and intense coding experience, and I would love to take
part in another one.</p>
<p>In June I have received an invitation to a Server Side Javascript R&amp;D Camp
organized by <a href="http://devmeetings.pl">DevMeetings.pl</a>. It was advertised as a
three days long, advanced and practical developers meeting in a nice hotel in
countryside. It didn’t take me long to accept the invitation and I was eagerly
waiting for it.
<!--more--></p>
<p>About twenty developers arrived on Thursday evening. For the first two days we
worked in 4-5 people teams researching different topics: security, performance,
NoSQL databases, integration, tools and practices. I worked on db team and will
describe the results in some following post. On each day’s evening each team
presented theirs results. This introduced a little pressure and made people
work hard, as no one would like to present nothing…</p>
<p>On Sunday, our final day, each team was given the same task: implement simple
twitter clone using some of the technologies we have worked with. We started at
10am and the deadline was dinner at 2pm. I, together with
<a href="http://twitter.com/#!/bartaz">Bartek</a> and
<a href="http://twitter.com/#!/gustaff_weldon">Bernard</a> used RingoJS deployed on
AppEngine, and the results are visible
<a href="http://devcamp.ringojs-twitter.appspot.com/">here</a>. I think it is quite
impressive that now we have technologies and tools that allows us to build such
things in a rather short time span. But to be honest we could use Python
instead of RingoJS and there would be even less work to do
(<a href="http://www.appenginejs.org/">appenginejs</a> is rather incomplete and needed
some patches).</p>
<p>And again, after dinner each team showed theirs app, each was crowd beta-tested
and the results of the camp was discussed. Sadly I had to leave before
the conclusion but I have a feeling that SSJS was not voted as production-ready
Java replacement… But more on that later.</p>
<p>I would like to thank the organizers and all the great attenders, I had a great
time and learned a lot about today’s hot technologies. See you next time!</p>
</content>
</entry>
<entry>
<title>Email notification for multiple Mercurial repositories</title>
<link href="http://rkj.github.com//mercurial/notifications/2010/06/10/MercurialMultipleReposEmail.html"/>
<updated>2010-06-10T00:00:00+00:00</updated>
<id>http://rkj.github.com//mercurial/notifications/2010/06/10/MercurialMultipleReposEmail</id>
<content type="html"><h1 id="mercurial-server">Mercurial-server</h1>
<p>Recently I have discovered the awesome <a href="http://www.lshift.net/mercurial-server.html">mercurial-server</a>. It gives flexible access control, takes care of file permission (which was problematic in our previous setup) and allows to hide physical path of the repo from its URL. It also offers global configuration file, which can include hooks for all of the repositories.</p>
<p>This offers an advantage in setting mail notifications using <a href="http://mercurial.selenic.com/wiki/NotifyExtension">NotifyExtension</a>. In our previous setup we had included all the options and the template in each’s repository hgrc file. It required us to use some setup script for creating repositories and was somewhat clumsy - there were no feel of some global administration.</p>
<p>Using global configuration allow us to easily put the configuration in the global rc file (in <code class="highlighter-rouge">/etc/mercurial-server/remote-hgrc.d/</code> dir) pointing to the template inside <code class="highlighter-rouge">hgadmin</code> repo. There would still be need to put <code class="highlighter-rouge">usersubs</code> or <code class="highlighter-rouge">reposubs</code> sections in each repository’ hgrc to specify who gets notification from which repo but it is now much cleaner and only info about specific repo goes to the hgrc.</p>
<h1 id="global-subscribers-configuration">Global subscribers configuration</h1>
<p>We wanted to get one step further and have some global configuration of subscribers in one place (and preferably in <code class="highlighter-rouge">hgadmin</code> repo), but this required some extra effort. Since I have not found a way to use any variables in <code class="highlighter-rouge">reposubs</code> section (which specifies who gets notified) we decided that all emails will go to local account (<code class="highlighter-rouge">hg@localhost</code> at the server), and will include custom header (X-Repo) with the repository name (it was easily added in the style file).
Then we use <a href="http://www.procmail.org/">procmail</a> to handle routing:</p>
<figure class="highlight"><pre><code class="language-bash" data-lang="bash">:0c
<span class="k">*</span> ^X-Repo: <span class="se">\/</span>.<span class="k">*</span>
<span class="o">{</span>
<span class="nv">RPATH</span><span class="o">=</span>/repos/hgadmin/subscriptions/<span class="k">${</span><span class="nv">MATCH</span><span class="k">}</span>
<span class="nv">DEFAULT</span><span class="o">=</span><span class="sb">`</span>cat /repos/hgadmin/subscriptions/default<span class="sb">`</span>
<span class="nv">EMAIL</span><span class="o">=</span><span class="sb">`</span><span class="k">if</span> <span class="o">[</span> -f <span class="nv">$RPATH</span> <span class="o">]</span>; <span class="k">then </span>cat <span class="nv">$RPATH</span>; <span class="k">else </span><span class="nb">echo</span> <span class="s2">"Please create file /hgadmin/subscriptions/</span><span class="k">${</span><span class="nv">MATCH</span><span class="k">}</span><span class="s2">!"</span> | mail -s <span class="s2">"No subscriptions for repository &gt;&gt;</span><span class="k">${</span><span class="nv">MATCH</span><span class="k">}</span><span class="s2">&lt;&lt;"</span> <span class="nv">$DEFAULT</span>; <span class="nb">echo</span> <span class="nv">$DEFAULT</span> ; <span class="k">fi</span> <span class="sb">`</span>
:0
! <span class="k">${</span><span class="nv">EMAIL</span><span class="k">}</span>
<span class="o">}</span></code></pre></figure>
<p>If email have the <code class="highlighter-rouge">X-Repo</code> header we search for subscribers in the repository’s file (addresses can be separated by spaces or newlines). If there is no relevant file, a warning email is sent to the default address, so we do not forget about any created repo.</p>
<p>Then procmail forwards the notification email to the subscribers and we are happy. Of course one could have even more advanced setup involving reading groups or privileges from LDAP, etc.</p>
<p>But maybe you know a simpler way (without procmail) to achieve this?</p>
<p>Below you see our configuration files:</p>
<h1 id="etcmercurial-serverremote-hgrcdnotifyrc">/etc/mercurial-server/remote-hgrc.d/notify.rc</h1>
<figure class="highlight"><pre><code class="language-ini" data-lang="ini"><span class="nn">[extensions]</span>
<span class="py">hgext.notify</span> <span class="p">=</span>
<span class="nn">[hooks]</span>
<span class="py">changegroup.notify</span> <span class="p">=</span> <span class="s">python:hgext.notify.hook</span>
<span class="nn">[email]</span>
<span class="py">from</span> <span class="p">=</span> <span class="s">vcs-hg@itiner.pl</span>
<span class="nn">[smtp]</span>
<span class="py">host</span> <span class="p">=</span> <span class="s">localhost</span>
<span class="nn">[notify]</span>
<span class="py">test</span> <span class="p">=</span> <span class="s">false</span>
<span class="py">strip</span> <span class="p">=</span> <span class="s">0</span>
<span class="py">style</span> <span class="p">=</span> <span class="s">/repos/hgadmin/styles/notify.style</span>
<span class="nn">[reposubs]</span>
<span class="c"># key is glob pattern, value is comma-separated list of subscriber emails
</span><span class="err">*</span> <span class="err">=</span> <span class="err">hg@localhost</span></code></pre></figure>
<h1 id="reposhgadminstylesnotifystyle">/repos/hgadmin/styles/notify.style</h1>
<figure class="highlight"><pre><code class="language-ini" data-lang="ini"><span class="py">start_files</span> <span class="p">=</span> <span class="s">'files: '</span>
<span class="py">file</span> <span class="p">=</span> <span class="s">' {file}'</span>
<span class="py">end_files</span> <span class="p">=</span> <span class="s">'\n'</span>
<span class="py">start_file_mods</span> <span class="p">=</span> <span class="s">'files: '</span>
<span class="py">file_mod</span> <span class="p">=</span> <span class="s">' {file_mod}'</span>
<span class="py">end_file_mods</span> <span class="p">=</span> <span class="s">'\n'</span>
<span class="py">start_file_adds</span> <span class="p">=</span> <span class="s">'files+: '</span>
<span class="py">file_add</span> <span class="p">=</span> <span class="s">' {file_add}'</span>
<span class="py">end_file_adds</span> <span class="p">=</span> <span class="s">'\n'</span>
<span class="py">start_file_dels</span> <span class="p">=</span> <span class="s">'files-: '</span>
<span class="py">file_del</span> <span class="p">=</span> <span class="s">' {file_del}'</span>
<span class="py">end_file_dels</span> <span class="p">=</span> <span class="s">'\n'</span>
<span class="py">start_file_copies</span> <span class="p">=</span> <span class="s">'copies: '</span>
<span class="py">file_copy</span> <span class="p">=</span> <span class="s">' {name} ({source})'</span>
<span class="py">end_file_copies</span> <span class="p">=</span> <span class="s">'\n'</span>
<span class="py">extra</span> <span class="p">=</span> <span class="s">'extra: {key}={value|stringescape}\n'</span>
<span class="py">changeset</span><span class="p">=</span><span class="s">"X-Repo: {webroot|basename}</span><span class="se">\n</span><span class="s">Subject: [{webroot|basename}] Mercurial changeset {rev} by {author}</span><span class="se">\n\n</span><span class="s">repository: https://vcs.itiner.pl/projects/{webroot|basename}/repository</span><span class="se">\n</span><span class="s">changeset: {rev}</span><span class="se">\n</span><span class="s">user: {author}</span><span class="se">\n</span><span class="s">date: {date|date}</span><span class="se">\n</span><span class="s">{file_mods}{file_adds}{file_dels}{file_copies_switch}{extras}</span><span class="se">\n</span><span class="s">description:</span><span class="se">\n</span><span class="s">{desc}</span><span class="se">\n</span><span class="s">"</span></code></pre></figure>
</content>
</entry>
<entry>
<title>Automatically adding specific files before commit to Mercurial repo</title>
<link href="http://rkj.github.com//mercurial/2010/05/26/Mercurial-PreCommitAddFiles.html"/>
<updated>2010-05-26T00:00:00+00:00</updated>
<id>http://rkj.github.com//mercurial/2010/05/26/Mercurial-PreCommitAddFiles</id>
<content type="html"><p>When you create new files, you have to add them manually to the repository using either <code class="highlighter-rouge">hg add</code> or <code class="highlighter-rouge">hg addremove</code> before issuing commit.</p>
<p>If you often create some files you may want not to be bothered by this additional commands. Then you may use <code class="highlighter-rouge">hg commit -A</code> which will add all new files automatically. You can even make an alias in your <code class="highlighter-rouge">.hgrc</code> so every commit will have the option turned on.</p>
<p>But there might be a problem if you have some unnecessary files (like logs) that are not specified in <code class="highlighter-rouge">.hgignore</code>.
If for some reason you cannot/do not want to create ignore rules for them, but you still want to add some files automatically with commit you may try a simple hook in your <code class="highlighter-rouge">.hgrc</code>:</p>
<figure class="highlight"><pre><code class="language-ini" data-lang="ini"><span class="nn">[hooks]</span>
<span class="py">pre-commit</span> <span class="p">=</span> <span class="s">hg add . -I 'glob:**.c' -q || true</span></code></pre></figure>
<p>Where you should of course change <code class="highlighter-rouge">'glob:**.c'</code> to a pattern that will match your files.
And do not confuse <code class="highlighter-rouge">pre-commit</code> with <code class="highlighter-rouge">precommit</code> as they are totally different.</p>
<p>Personally I do not use this, because always before issuing a commit I do:</p>
<figure class="highlight"><pre><code class="language-sh" data-lang="sh"><span class="gp">$ </span>hg status
<span class="gp">$ </span>hg diff | mate </code></pre></figure>
<p>to see all the changes made and make sure that there are no leftover changes (like some random debugging printfs). But your situation may be different and you can give the hook a try.</p>
</content>
</entry>
<entry>
<title>Mercurial rollback can cause unexpected data loss</title>
<link href="http://rkj.github.com//mercurial/2010/02/18/Mercurial-rollback-dataloss.html"/>
<updated>2010-02-18T00:00:00+00:00</updated>
<id>http://rkj.github.com//mercurial/2010/02/18/Mercurial-rollback-dataloss</id>
<content type="html"><p>One of import differences between Mercurial and git is that in the former it is very hard to change the history of a repository. It is by design (although it can be ,,bypassed’’ with usage of plugins like MQ) and there are good reasons for it. This gives this warm, fuzzy feeling of security and makes harder to screw up really hard. So I consider it to be a real user-friendly aspect of Mercurial.</p>
<p>But recently I was bitten by rollback command which undo last transaction (<code class="highlighter-rouge">commit</code>, <code class="highlighter-rouge">pull</code>, and some other commands). I use it more often than I would like - mostly for undoing commit. It helps when I have forgot to add a file, or added some garbage, etc. It is suitable only before pushing changes somewhere, because otherwise they will back. But in my case, many simple mistakes are spotted right away, just after <code class="highlighter-rouge">commit</code>, and maybe <code class="highlighter-rouge">status</code>. They may be fixed without rollback - with a second commit, but it would unnecessary clobber history and make look silly. So if I make an erroneous commit I rollback, fix it and commit again. This procedure has worked fine many, many times. Until recently.</p>
<p>Let see some contrived example. First we create a repo:</p>
<figure class="highlight"><pre><code class="language-sh" data-lang="sh"><span class="gp">$ </span>mkdir example
<span class="gp">$ </span><span class="nb">cd </span>example
<span class="gp">$ </span>hg init
<span class="gp">$ </span><span class="nb">echo </span>a &gt; a
<span class="gp">$ </span>hg ci -Am <span class="s1">'a'</span></code></pre></figure>
<p>Next we start some development in a branch:</p>
<figure class="highlight"><pre><code class="language-sh" data-lang="sh"><span class="gp">$ </span>hg branch b
<span class="gp">$ </span><span class="nb">echo </span>b &gt; b
<span class="gp">$ </span>hg ci -Am <span class="s1">'b'</span></code></pre></figure>
<p>And next we make mistaken commit</p>
<figure class="highlight"><pre><code class="language-sh" data-lang="sh"><span class="gp">$ </span><span class="nb">echo </span>d &gt; c
<span class="gp">$ </span>hg ci -Am <span class="s1">'c'</span></code></pre></figure>
<p>Normally if we spot error right away we can issue:</p>
<figure class="highlight"><pre><code class="language-sh" data-lang="sh"><span class="gp">$ </span>hg rollback
<span class="gp">$ </span>cat c
d <span class="c"># ok, the file is there, we can fix it</span>
<span class="gp">$ </span>hg ci -Am <span class="s1">'c'</span></code></pre></figure>
<p>But if we change a branch and then make a rollback we are screwed:</p>
<figure class="highlight"><pre><code class="language-sh" data-lang="sh"><span class="gp">$ </span>hg up default
<span class="gp">$ </span>hg rollback
<span class="gp">$ </span>cat c
cat: c: No such file or directory <span class="c"># hmmm</span>
<span class="gp">$ </span>hg up b <span class="c"># let's go to our original branch</span>
<span class="gp">$ </span>cat c
cat: c: No such file or directory <span class="c"># still nothing :/</span></code></pre></figure>
<p>Since the commit is undone there is no <code class="highlighter-rouge">c</code> in branch <code class="highlighter-rouge">b</code>, so <code class="highlighter-rouge">update</code> does not help. Fortunately for me, I had editor windows still opened, where I could use undo - to redo my changes. This time I have not lost anything. But there were some scary moments ;-).</p>
<p>So I think the lesson is that if you are to make a rollback then be sure that you have not made any updates to the repository, because rollback will not even warn you.</p>
</content>
</entry>
<entry>
<title>Updated Rakefile for dealing with mercurial subrepos</title>
<link href="http://rkj.github.com//mercurial/rake/subrepositories/2010/02/13/Mercurial-subrepos-update.html"/>
<updated>2010-02-13T00:00:00+00:00</updated>
<id>http://rkj.github.com//mercurial/rake/subrepositories/2010/02/13/Mercurial-subrepos-update</id>
<content type="html"><p>Previously I have described my solution for subrepos in Mercurial. It was working nicely but, a little painfully, because you had to add every single Mercurial command you like to use as Rake task. After little googleing and scratching head I have come with a much nicer and general solution:</p>
<figure class="highlight"><pre><code class="language-ruby" data-lang="ruby"><span class="n">desc</span> <span class="s2">"Invokes any hg command on every repo."</span>
<span class="n">task</span> <span class="s2">"hg:command"</span>
<span class="n">rule</span> <span class="sr">/^hg:/</span> <span class="k">do</span> <span class="o">|</span><span class="n">t</span><span class="o">|</span> <span class="c1"># rule selecting task starting with hg:</span>
<span class="n">on_repos</span><span class="p">(</span><span class="s2">"hg </span><span class="si">#{</span><span class="n">t</span><span class="p">.</span><span class="nf">name</span><span class="p">.</span><span class="nf">sub</span><span class="p">(</span><span class="sr">/^hg:/</span><span class="p">,</span> <span class="s2">""</span><span class="p">)</span><span class="si">}</span><span class="s2">"</span><span class="p">)</span> <span class="c1"># 'hg:' is in task name so we have to erase it</span>
<span class="k">end</span> <span class="c1"># and that's all folks :D</span></code></pre></figure>
<p>It is shorter and any command may be used, with any parameters, like:</p>
<figure class="highlight"><pre><code class="language-ruby" data-lang="ruby"><span class="n">rake</span> <span class="n">hg</span><span class="ss">:status</span>
<span class="n">rake</span> <span class="n">hg</span><span class="ss">:"revert -a"</span></code></pre></figure>
<p>and so on. Enjoy!</p>
</content>
</entry>
<entry>
<title>Shifting bytes left in different languages</title>
<link href="http://rkj.github.com//java/ruby/python/c/programming/2010/01/19/Shift-left.html"/>
<updated>2010-01-19T00:00:00+00:00</updated>
<id>http://rkj.github.com//java/ruby/python/c/programming/2010/01/19/Shift-left</id>
<content type="html"><p>Recently I must have had a really bad day, because I wrote a very stupid line of code in java:</p>
<figure class="highlight"><pre><code class="language-java" data-lang="java"><span class="kt">int</span> <span class="n">x</span> <span class="o">=</span> <span class="mi">2</span> <span class="o">&lt;&lt;</span> <span class="o">(</span><span class="n">s</span> <span class="o">-</span> <span class="mi">1</span><span class="o">);</span></code></pre></figure>
<p>when I wanted something a little simpler:</p>
<figure class="highlight"><pre><code class="language-java" data-lang="java"><span class="kt">int</span> <span class="n">x</span> <span class="o">=</span> <span class="mi">1</span> <span class="o">&lt;&lt;</span> <span class="n">s</span><span class="o">;</span></code></pre></figure>
<p>It gave unexpected (for me) behavior for shifting negative number of bits. I was having hope that shifting -1 place left would be equal to shifting 1 place right. But I was wrong, and the result was always zero, no matter what number I would try to shift.
After a little search I have found: <a href="http://java.sun.com/docs/books/tutorial/java/nutsandbolts/op3.html" title="Bitwise and Bit Shift Operators">Bitwise and Bit Shift Operators</a> but is says nothing about this case.</p>
<p>So I tried the same code in different languages to check how they behave. I started with my favorite – Ruby:</p>
<figure class="highlight"><pre><code class="language-ruby" data-lang="ruby"><span class="k">def</span> <span class="nf">shift</span><span class="p">(</span><span class="n">no</span><span class="p">,</span> <span class="n">s</span><span class="p">)</span>
<span class="p">[</span><span class="n">no</span> <span class="o">&lt;&lt;</span> <span class="p">(</span><span class="n">s</span> <span class="o">-</span> <span class="mi">1</span><span class="p">),</span> <span class="p">(</span><span class="n">no</span> <span class="o">&lt;&lt;</span> <span class="n">s</span><span class="p">)</span> <span class="o">/</span> <span class="mi">2</span><span class="p">]</span>
<span class="k">end</span>
<span class="n">shift</span><span class="p">(</span><span class="mi">2</span><span class="p">,</span> <span class="mi">8</span><span class="p">)</span> <span class="c1"># =&gt; [256, 256]</span>
<span class="n">shift</span><span class="p">(</span><span class="mi">2</span><span class="p">,</span> <span class="mi">1</span><span class="p">)</span> <span class="c1"># =&gt; [2, 2]</span>
<span class="n">shift</span><span class="p">(</span><span class="mi">2</span><span class="p">,</span> <span class="mi">0</span><span class="p">)</span> <span class="c1"># =&gt; [1, 1]</span>
<span class="n">shift</span><span class="p">(</span><span class="mi">256</span><span class="p">,</span> <span class="o">-</span><span class="mi">4</span><span class="p">)</span> <span class="c1"># =&gt; [8, 8]</span></code></pre></figure>
<p>Here the results were as I would like them to be.</p>
<p>So what does our good, old friend C has to say?</p>
<figure class="highlight"><pre><code class="language-c" data-lang="c"><span class="cp">#include &lt;stdio.h&gt;
</span><span class="kt">void</span> <span class="nf">shift</span><span class="p">(</span><span class="kt">int</span> <span class="n">no</span><span class="p">,</span> <span class="kt">int</span> <span class="n">s</span><span class="p">)</span> <span class="p">{</span>
<span class="kt">int</span> <span class="n">one</span> <span class="o">=</span> <span class="n">no</span> <span class="o">&lt;&lt;</span> <span class="p">(</span><span class="n">s</span> <span class="o">-</span> <span class="mi">1</span><span class="p">);</span>
<span class="kt">int</span> <span class="n">two</span> <span class="o">=</span> <span class="p">(</span><span class="n">no</span> <span class="o">&lt;&lt;</span> <span class="n">s</span><span class="p">)</span> <span class="o">/</span> <span class="mi">2</span><span class="p">;</span>
<span class="n">printf</span><span class="p">(</span><span class="s">"Shift(%d, %d): %d, %d</span><span class="se">\n</span><span class="s">"</span><span class="p">,</span> <span class="n">no</span><span class="p">,</span> <span class="n">s</span><span class="p">,</span> <span class="n">one</span><span class="p">,</span> <span class="n">two</span><span class="p">);</span>
<span class="p">}</span>
<span class="kt">int</span> <span class="nf">main</span><span class="p">()</span> <span class="p">{</span>
<span class="n">shift</span><span class="p">(</span><span class="mi">2</span><span class="p">,</span> <span class="mi">8</span><span class="p">);</span> <span class="c1">// 256, 256
</span> <span class="n">shift</span><span class="p">(</span><span class="mi">2</span><span class="p">,</span> <span class="mi">1</span><span class="p">);</span> <span class="c1">// 2, 2
</span> <span class="n">shift</span><span class="p">(</span><span class="mi">2</span><span class="p">,</span> <span class="mi">0</span><span class="p">);</span> <span class="c1">// 0, 1
</span> <span class="n">shift</span><span class="p">(</span><span class="mi">256</span><span class="p">,</span> <span class="o">-</span><span class="mi">4</span><span class="p">);</span> <span class="c1">// 0, 0
</span> <span class="k">return</span> <span class="mi">0</span><span class="p">;</span>
<span class="p">}</span></code></pre></figure>
<p>It gives the same results as Java code. So I have to assume that it is described in some specification, or maybe is processor specific as shifting is simple assembler instruction.</p>
<p>Last language I have tried is Ruby companion, Python, which behave more elegantly than C or Java:</p>
<figure class="highlight"><pre><code class="language-python" data-lang="python"><span class="k">def</span> <span class="nf">shift</span><span class="p">(</span><span class="n">no</span><span class="p">)</span> <span class="p">:</span>
<span class="k">return</span> <span class="p">(</span><span class="mi">2</span> <span class="o">&lt;&lt;</span> <span class="p">(</span><span class="n">no</span> <span class="o">-</span> <span class="mi">1</span><span class="p">),</span> <span class="p">(</span><span class="mi">2</span> <span class="o">&lt;&lt;</span> <span class="n">no</span><span class="p">)</span> <span class="o">/</span> <span class="mi">2</span><span class="p">)</span>
<span class="k">print</span> <span class="n">shift</span><span class="p">(</span><span class="mi">8</span><span class="p">)</span> <span class="c"># (256, 256)</span>
<span class="k">print</span> <span class="n">shift</span><span class="p">(</span><span class="mi">1</span><span class="p">)</span> <span class="c"># (2, 2)</span>
<span class="k">print</span> <span class="n">shift</span><span class="p">(</span><span class="mi">0</span><span class="p">)</span> <span class="c"># ValueError: negative shift count</span></code></pre></figure>
<p>It actually checked for nonsensical value and raised an exception not giving me silently wrong (from my perspective) value.
Sadly Python does not behave similarly for other basic mistakes, as:</p>
<figure class="highlight"><pre><code class="language-python" data-lang="python"><span class="o">&gt;&gt;&gt;</span> <span class="mi">2</span> <span class="o">&lt;</span> <span class="s">"1"</span>
<span class="bp">True</span></code></pre></figure>
</content>
</entry>
<entry>
<title>Rakefile for dealing with mercurial subrepos</title>
<link href="http://rkj.github.com//mercurial/rake/subrepositories/2010/01/16/Mercurial-subrepos.html"/>
<updated>2010-01-16T00:00:00+00:00</updated>
<id>http://rkj.github.com//mercurial/rake/subrepositories/2010/01/16/Mercurial-subrepos</id>
<content type="html"><p>Subrepositories are quite an important things if you have any bigger project, where you can extract some libraries, modules, parts or any other such things. We have chosen Mercurial for version control, because it distributed (svn suxx ass), easy to use, and have good cross-platform support. Until version 1.3 it did not have any support for subrepositories, now it has one.
It is far from perfect, but still much better than nothing. And I am sure it will get better, or eventually great.</p>
<p>But for the time I have made quite simple Rakefile to help with subrepos. It works fine also with Mercurial versions less than 1.3. To use it you must have Ruby and gems: rake, term-ansicolor installed. Script assumes that you have subdirectories which are Mercurial repositiories and issues commands on each of them.</p>
<p><code class="highlighter-rouge">rake -T</code> will give you list of available tasks:</p>
<ul>
<li>hg - generic task. You can invoke it like: <code class="highlighter-rouge">CMD=status rake hg</code> and it will invoke CMD on every repository. In this case it will display status.</li>
<li>pull, push, status - does what the name says.</li>
<li>update - does pull -u.</li>
<li>rebase - does pull –rebase. You must have <code class="highlighter-rouge">hgext.rebase=</code> enabled for it to work.</li>
<li>tag - invoked <code class="highlighter-rouge">rake tag name=0.1</code> will tag each repository with tag 0.1.</li>
<li>upto - invoked <code class="highlighter-rouge">rake upto rev=0.2</code> will do <code class="highlighter-rouge">hg up -r 0.2</code>. It makes sense with branches and tags, not so much with revisions, because each subrepo have its own history ;-).</li>
<li>commit - if you have main repository, that contains all the others (as when you use Mercurial subrepo support) it will read commit messages from all the subrepos since last commit to the main repo and copy them into clipboard (works on a Mac only, I think).</li>
</ul>
<figure class="highlight"><pre><code class="language-ruby" data-lang="ruby"><table style="border-spacing: 0"><tbody><tr><td class="gutter gl" style="text-align: right"><pre class="lineno">1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90</pre></td><td class="code"><pre><span class="k">begin</span>
<span class="nb">require</span> <span class="s1">'term/ansicolor'</span>
<span class="k">class</span> <span class="nc">String</span>
<span class="kp">include</span> <span class="no">Term</span><span class="o">::</span><span class="no">ANSIColor</span>
<span class="k">end</span>
<span class="k">rescue</span> <span class="no">LoadError</span> <span class="o">=&gt;</span> <span class="n">e</span>
<span class="no">STDERR</span><span class="p">.</span><span class="nf">puts</span> <span class="s2">"Please install gem term-ansicolor to use this rakefile."</span>
<span class="nb">exit</span><span class="p">(</span><span class="mi">1</span><span class="p">)</span>
<span class="k">end</span>
<span class="vi">@repos</span> <span class="o">=</span> <span class="no">Dir</span><span class="p">[</span><span class="s2">"**/.hg"</span><span class="p">].</span><span class="nf">map</span> <span class="p">{</span> <span class="o">|</span><span class="n">f</span><span class="o">|</span> <span class="n">f</span><span class="p">.</span><span class="nf">sub</span><span class="p">(</span><span class="sr">/.hg$/</span><span class="p">,</span> <span class="s2">""</span><span class="p">)</span> <span class="p">}</span>
<span class="k">def</span> <span class="nf">on_repos</span><span class="p">(</span><span class="n">cmd</span><span class="p">)</span>
<span class="vi">@repos</span><span class="p">.</span><span class="nf">each</span> <span class="k">do</span> <span class="o">|</span><span class="n">project</span><span class="o">|</span>
<span class="k">begin</span>
<span class="nb">puts</span> <span class="s2">"= </span><span class="si">#{</span><span class="n">project</span><span class="p">.</span><span class="nf">red</span><span class="si">}</span><span class="s2"> (</span><span class="si">#{</span><span class="n">cmd</span><span class="p">.</span><span class="nf">yellow</span><span class="si">}</span><span class="s2">)"</span>
<span class="nb">system</span> <span class="s2">"cd </span><span class="si">#{</span><span class="n">project</span><span class="si">}</span><span class="s2">; </span><span class="si">#{</span><span class="n">cmd</span><span class="si">}</span><span class="s2">"</span>
<span class="k">rescue</span> <span class="o">=&gt;</span> <span class="n">e</span>
<span class="nb">puts</span> <span class="s2">"Failed because: </span><span class="si">#{</span><span class="n">e</span><span class="si">}</span><span class="s2">"</span><span class="p">.</span><span class="nf">red</span>
<span class="k">end</span>
<span class="k">end</span>
<span class="k">end</span>
<span class="n">desc</span> <span class="s2">"Invokes any hg command on every repo. Provide CMD=command env variable."</span>
<span class="n">task</span> <span class="ss">:hg</span> <span class="k">do</span>
<span class="k">if</span> <span class="no">ENV</span><span class="p">[</span><span class="s1">'CMD'</span><span class="p">].</span><span class="nf">nil?</span>
<span class="nb">puts</span> <span class="s2">"Missing rev parameter"</span>
<span class="k">else</span>
<span class="n">on_repos</span><span class="p">(</span><span class="s2">"hg </span><span class="si">#{</span><span class="no">ENV</span><span class="p">[</span><span class="s1">'CMD'</span><span class="p">]</span><span class="si">}</span><span class="s2">"</span><span class="p">)</span>
<span class="k">end</span>
<span class="k">end</span>
<span class="n">desc</span> <span class="s2">"Rebase"</span>
<span class="n">task</span> <span class="ss">:rebase</span> <span class="k">do</span>
<span class="n">on_repos</span><span class="p">(</span><span class="s2">"hg pull --rebase"</span><span class="p">)</span>
<span class="k">end</span>
<span class="n">desc</span> <span class="s2">"Pull"</span>
<span class="n">task</span> <span class="ss">:pull</span> <span class="k">do</span>
<span class="n">on_repos</span><span class="p">(</span><span class="s2">"hg pull"</span><span class="p">)</span>
<span class="k">end</span>
<span class="n">desc</span> <span class="s2">"Pull -u"</span>
<span class="n">task</span> <span class="ss">:update</span> <span class="k">do</span>
<span class="n">on_repos</span><span class="p">(</span><span class="s2">"hg pull -u"</span><span class="p">)</span>
<span class="k">end</span>
<span class="n">desc</span> <span class="s2">"Status"</span>
<span class="n">task</span> <span class="ss">:status</span> <span class="k">do</span>
<span class="n">on_repos</span><span class="p">(</span><span class="s2">"hg status"</span><span class="p">)</span>
<span class="k">end</span>
<span class="n">desc</span> <span class="s2">"Push"</span>
<span class="n">task</span> <span class="ss">:push</span> <span class="k">do</span>
<span class="n">on_repos</span><span class="p">(</span><span class="s2">"hg push"</span><span class="p">)</span>
<span class="k">end</span>
<span class="n">desc</span> <span class="s2">"Tag repos. Provide name=tag_name param"</span>
<span class="n">task</span> <span class="ss">:tag</span> <span class="k">do</span> <span class="o">|</span><span class="n">t</span><span class="p">,</span> <span class="n">args</span><span class="o">|</span>
<span class="k">if</span> <span class="n">args</span><span class="p">.</span><span class="nf">name</span><span class="p">.</span><span class="nf">nil?</span>
<span class="nb">puts</span> <span class="s2">"Missing name parameter"</span>
<span class="k">else</span>
<span class="n">on_repos</span><span class="p">(</span><span class="s2">"hg tag </span><span class="si">#{</span><span class="n">args</span><span class="p">.</span><span class="nf">name</span><span class="si">}</span><span class="s2">"</span><span class="p">)</span>
<span class="k">end</span>
<span class="k">end</span>
<span class="n">desc</span> <span class="s2">"Update all repos to some revision. Provide rev=revision parameter."</span>
<span class="n">task</span> <span class="ss">:upto</span> <span class="k">do</span> <span class="o">|</span><span class="n">t</span><span class="p">,</span> <span class="n">args</span><span class="o">|</span>
<span class="k">if</span> <span class="n">args</span><span class="p">.</span><span class="nf">rev</span><span class="p">.</span><span class="nf">nil?</span>
<span class="nb">puts</span> <span class="s2">"Missing rev parameter"</span>
<span class="k">else</span>
<span class="n">on_repos</span><span class="p">(</span><span class="s2">"hg update -C </span><span class="si">#{</span><span class="n">args</span><span class="p">.</span><span class="nf">name</span><span class="si">}</span><span class="s2">"</span><span class="p">)</span>
<span class="k">end</span>
<span class="k">end</span>
<span class="n">desc</span> <span class="s2">"Commits main repository and adds commits messages from every subrepo."</span>
<span class="n">task</span> <span class="ss">:commit</span> <span class="k">do</span>
<span class="nb">require</span> <span class="s1">'time'</span>
<span class="n">last_date</span> <span class="o">=</span> <span class="no">Time</span><span class="p">.</span><span class="nf">parse</span><span class="p">(</span><span class="sx">%x{hg tip --template '{date|isodate}'}</span><span class="p">)</span> <span class="o">+</span> <span class="mi">60</span>
<span class="n">date</span> <span class="o">=</span> <span class="n">last_date</span><span class="p">.</span><span class="nf">strftime</span><span class="p">(</span><span class="s2">"%Y-%m-%d %H:%M"</span><span class="p">)</span>
<span class="n">out</span> <span class="o">=</span> <span class="vi">@repos</span><span class="p">.</span><span class="nf">reject</span> <span class="p">{</span> <span class="o">|</span><span class="n">r</span><span class="o">|</span> <span class="no">File</span><span class="p">.</span><span class="nf">basename</span><span class="p">(</span><span class="n">r</span><span class="p">)</span> <span class="o">==</span> <span class="s2">"itiner"</span> <span class="p">}.</span><span class="nf">map</span> <span class="k">do</span> <span class="o">|</span><span class="n">r</span><span class="o">|</span>
<span class="n">log</span> <span class="o">=</span> <span class="sx">%x{cd </span><span class="si">#{</span><span class="n">r</span><span class="si">}</span><span class="sx"> &amp;&amp; hg log -d '&gt;</span><span class="si">#{</span><span class="n">date</span><span class="si">}</span><span class="sx">' --template="== {desc}</span><span class="se">\n</span><span class="sx">"}</span><span class="p">.</span><span class="nf">strip</span>
<span class="n">log</span><span class="p">.</span><span class="nf">empty?</span> <span class="p">?</span> <span class="kp">nil</span> <span class="p">:</span> <span class="s2">"= </span><span class="si">#{</span><span class="no">File</span><span class="p">.</span><span class="nf">basename</span> <span class="n">r</span><span class="si">}</span><span class="se">\n</span><span class="si">#{</span><span class="n">log</span><span class="si">}</span><span class="s2">"</span>
<span class="k">end</span>
<span class="n">msg</span> <span class="o">=</span> <span class="n">out</span><span class="p">.</span><span class="nf">compact</span><span class="p">.</span><span class="nf">join</span><span class="p">(</span><span class="s2">"</span><span class="se">\n\n</span><span class="s2">"</span><span class="p">).</span><span class="nf">strip</span>
<span class="c1"># mac only?</span>
<span class="no">IO</span><span class="p">.</span><span class="nf">popen</span><span class="p">(</span><span class="s1">'pbcopy'</span><span class="p">,</span> <span class="s1">'w'</span><span class="p">)</span> <span class="k">do</span> <span class="o">|</span><span class="n">pipe</span><span class="o">|</span>
<span class="n">pipe</span><span class="p">.</span><span class="nf">print</span><span class="p">(</span><span class="n">msg</span><span class="p">)</span>
<span class="k">end</span>
<span class="nb">system</span> <span class="s2">"hg commit &amp;&amp; hg push"</span>
<span class="k">end</span><span class="w">
</span></pre></td></tr></tbody></table></code></pre></figure>
</content>
</entry>
<entry>
<title>WYSIWYG in LaTeX with Rakefile</title>
<link href="http://rkj.github.com//latex/rake/2010/01/15/LaTeX-with-Rakefile.html"/>
<updated>2010-01-15T00:00:00+00:00</updated>
<id>http://rkj.github.com//latex/rake/2010/01/15/LaTeX-with-Rakefile</id>
<content type="html"><p>First tools to work with text I have learned where WYSIWYG and everywhere I looked they were praised. I really did not know the alternative and could not think a reason for it to exist in time of fast computers.
But when I was exposed to LaTex I have instantly fell in love. Compared to Word it produces astonishingly beautiful output, takes whole of issues away (numbering, chapters, pages with one sentence on it, positioning pictures and many, many more) and lets you focus on content (and structure) not on the final look (just like WriteBoard, I am using right now). But it has some issues. Actually my relationship with LaTeX is more like love-hate one. Sooner or later you will want to do some non-trivial thing to do. And you will either have to dig really deeply or find someone who did it before - and neither one may be easy, even on the shoulders of google.
But knowing many its faults I still do know imagine writing a thesis in some word processor instead of LaTeX - that would be just plain stupid…</p>
<p>I would like to share with you a simple script that makes life with LaTeX a little nicer. It is a Rakefile, so you need to have Ruby and Rake gem installed. It contains some simple tasks:</p>
<ul>
<li>compile - creates PDF document from the source files. It makes a couple of runs too much, but just to be sure that everything is on the right place, with bibliography, table of content, etc.</li>
<li>clean - deletes all the messy and unneeded files if you would like to zip folder and send it somewhere.</li>
<li>view - tries to display PDF in some reader available on your system.</li>
<li>run - the most interesting part. I have stolen a concept from autotest. It checks the source files every second and if there are any changes it runs compile. If you have a sane PDF reader (like Sumatra on Windows, or Skim on Mac - not Adobe Reader) then it will refresh pdf as soon as it is ready. So you can have this rake task running in the background, and every time you hit save you will see changes in the pdf.</li>
</ul>
<p>I consider this to be the best of both worlds. You can write freely for most time, but when you are finishing and want to take care of the final look you can almost instantly see changes in PDF and tweek quirks until you are satisfied.</p>
<p>There are other tools to acomplish similar results. One of them is LyX, but there is a problem with retrieving source tex files from it. And you will need them at some point, to do some tweeking, or something that is not available in LyX, but can be done in LaTeX and it will be painful. So do not even start - its lame. There are also some editors with similar capabilities, but I have my favorite editor, you may have yours - and the scripts works with them all.</p>
<figure class="highlight"><pre><code class="language-ruby" data-lang="ruby"><table style="border-spacing: 0"><tbody><tr><td class="gutter gl" style="text-align: right"><pre class="lineno">1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56</pre></td><td class="code"><pre><span class="no">TEX_FILES</span> <span class="o">=</span> <span class="no">FileList</span><span class="p">[</span><span class="s2">"*.tex"</span><span class="p">,</span> <span class="s2">"figures/*"</span><span class="p">,</span> <span class="s2">"*.bib"</span><span class="p">,</span> <span class="s2">"*.cls"</span><span class="p">]</span>
<span class="no">MAIN</span> <span class="o">=</span> <span class="s2">"thesis"</span> <span class="c1"># name of main file without .tex suffix</span>
<span class="no">MAIN_TEX</span> <span class="o">=</span> <span class="s2">"</span><span class="si">#{</span><span class="no">MAIN</span><span class="si">}</span><span class="s2">.tex"</span>
<span class="no">OUT_PDF</span> <span class="o">=</span> <span class="s2">"</span><span class="si">#{</span><span class="no">MAIN</span><span class="si">}</span><span class="s2">.pdf"</span>
<span class="n">file</span> <span class="no">OUT_PDF</span> <span class="o">=&gt;</span> <span class="no">TEX_FILES</span> <span class="k">do</span>
<span class="sx">%x{pdflatex -interaction=batchmode </span><span class="si">#{</span><span class="no">MAIN_TEX</span><span class="si">}</span><span class="sx"> &gt;/dev/null 2&gt;&amp;1}</span>
<span class="sx">%x{makeindex </span><span class="si">#{</span><span class="no">MAIN</span><span class="si">}</span><span class="sx"> &gt;/dev/null 2&gt;&amp;1}</span>
<span class="sx">%x{bibtex </span><span class="si">#{</span><span class="no">MAIN</span><span class="si">}</span><span class="sx"> &gt;/dev/null 2&gt;&amp;1}</span>
<span class="sx">%x{pdflatex -interaction=batchmode </span><span class="si">#{</span><span class="no">MAIN_TEX</span><span class="si">}</span><span class="sx"> &gt;/dev/null 2&gt;&amp;1}</span>
<span class="sx">%x{bibtex </span><span class="si">#{</span><span class="no">MAIN</span><span class="si">}</span><span class="sx">.gls &gt;/dev/null 2&gt;&amp;1}</span>
<span class="sx">%x{makeindex </span><span class="si">#{</span><span class="no">MAIN</span><span class="si">}</span><span class="sx"> &gt;/dev/null 2&gt;&amp;1}</span>
<span class="sx">%x{pdflatex -interaction=batchmode </span><span class="si">#{</span><span class="no">MAIN_TEX</span><span class="si">}</span><span class="sx"> &gt;/dev/null 2&gt;&amp;1}</span>
<span class="sx">%x{makeindex </span><span class="si">#{</span><span class="no">MAIN</span><span class="si">}</span><span class="sx"> &gt;/dev/null 2&gt;&amp;1}</span>
<span class="sx">%x{bibtex </span><span class="si">#{</span><span class="no">MAIN</span><span class="si">}</span><span class="sx"> &gt;/dev/null 2&gt;&amp;1}</span>
<span class="sx">%x{bibtex </span><span class="si">#{</span><span class="no">MAIN</span><span class="si">}</span><span class="sx">.gls &gt;/dev/null 2&gt;&amp;1}</span>
<span class="sx">%x{pdflatex -interaction=batchmode </span><span class="si">#{</span><span class="no">MAIN_TEX</span><span class="si">}</span><span class="sx"> &gt;/dev/null 2&gt;&amp;1}</span>
<span class="n">sh</span> <span class="s2">"pdflatex -interaction=batchmode </span><span class="si">#{</span><span class="no">MAIN_TEX</span><span class="si">}</span><span class="s2"> &gt;/dev/null"</span>
<span class="n">cp</span> <span class="s2">"</span><span class="si">#{</span><span class="no">MAIN</span><span class="si">}</span><span class="s2">.pdf"</span>
<span class="k">end</span>
<span class="n">desc</span> <span class="s2">"Removes unnecessery files"</span>
<span class="n">task</span> <span class="ss">:clean</span> <span class="k">do</span>
<span class="n">rm</span> <span class="no">FileList</span><span class="p">[</span><span class="s2">"*.aux"</span><span class="p">,</span><span class="s2">"*.bak"</span><span class="p">,</span><span class="s2">"*.log"</span><span class="p">,</span><span class="s2">"*.blg"</span><span class="p">,</span> <span class="s2">"*.bbl"</span><span class="p">,</span> <span class="s2">"*.toc"</span><span class="p">,</span> <span class="s2">"*.out"</span><span class="p">,</span> <span class="s2">"*.idx"</span><span class="p">,</span> <span class="s2">"*.ilg"</span><span class="p">,</span> <span class="s2">"*.ind"</span><span class="p">,</span> <span class="s2">"</span><span class="si">#{</span><span class="no">THESIS</span><span class="si">}</span><span class="s2">.pdf"</span><span class="p">,</span> <span class="s2">"*.dep"</span><span class="p">,</span> <span class="s2">"*.glo"</span><span class="p">,</span> <span class="s2">"*.gls"</span><span class="p">]</span>
<span class="k">end</span>
<span class="n">task</span> <span class="ss">:default</span> <span class="o">=&gt;</span> <span class="p">[</span><span class="ss">:run</span><span class="p">]</span>
<span class="s2">"Open text editor to edit files"</span>
<span class="n">task</span> <span class="ss">:edit</span> <span class="k">do</span>
<span class="p">[</span><span class="s2">"$EDITOR"</span><span class="p">,</span> <span class="s2">"mate"</span><span class="p">,</span> <span class="s2">"vim"</span><span class="p">,</span> <span class="s2">"emacs"</span><span class="p">].</span><span class="nf">find</span> <span class="k">do</span> <span class="o">|</span><span class="n">editor</span><span class="o">|</span>
<span class="nb">system</span> <span class="s2">"</span><span class="si">#{</span><span class="n">editor</span><span class="si">}</span><span class="s2"> *tex *bib Rakefile"</span>
<span class="k">end</span> <span class="n">or</span> <span class="nb">puts</span> <span class="s2">"Unable to find text editor."</span>
<span class="k">end</span>
<span class="n">task</span> <span class="ss">:run</span> <span class="k">do</span>
<span class="n">t</span> <span class="o">=</span> <span class="no">Rake</span><span class="o">::</span><span class="no">Task</span><span class="p">[</span><span class="no">OUT_PDF</span><span class="p">]</span>
<span class="kp">loop</span> <span class="k">do</span>
<span class="k">begin</span>
<span class="n">t</span><span class="p">.</span><span class="nf">reenable</span>
<span class="n">t</span><span class="p">.</span><span class="nf">invoke</span>
<span class="k">rescue</span> <span class="o">=&gt;</span> <span class="n">e</span>
<span class="nb">puts</span> <span class="s2">"Error: </span><span class="si">#{</span><span class="n">e</span><span class="si">}</span><span class="s2">, sleeping for 15 seconds."</span>
<span class="no">Kernel</span><span class="p">.</span><span class="nf">sleep</span> <span class="mi">15</span>
<span class="k">end</span>
<span class="no">Kernel</span><span class="p">.</span><span class="nf">sleep</span> <span class="mi">1</span>
<span class="k">end</span>
<span class="k">end</span>
<span class="n">desc</span> <span class="s2">"Shows Thesis"</span>
<span class="n">task</span> <span class="ss">:view</span> <span class="o">=&gt;</span> <span class="p">[</span><span class="no">OUT_PDF</span><span class="p">]</span> <span class="k">do</span>
<span class="p">[</span><span class="s2">"open"</span><span class="p">,</span> <span class="s2">"okular"</span><span class="p">,</span> <span class="s2">"kpdf"</span><span class="p">,</span> <span class="s2">"acroread"</span><span class="p">].</span><span class="nf">find</span> <span class="k">do</span> <span class="o">|</span><span class="n">viewer</span><span class="o">|</span>
<span class="nb">system</span> <span class="s2">"</span><span class="si">#{</span><span class="n">viewer</span><span class="si">}</span><span class="s2"> </span><span class="si">#{</span><span class="no">OUT_PDF</span><span class="si">}</span><span class="s2">"</span>
<span class="k">end</span> <span class="n">or</span>
<span class="nb">puts</span> <span class="s2">"Unable to find any pdf viewer."</span>
<span class="k">end</span><span class="w">
</span></pre></td></tr></tbody></table></code></pre></figure>
</content>
</entry>
</feed>
If you would like to create a banner that links to this page (i.e. this validation result), do the following:
Download the "valid Atom 1.0" banner.
Upload the image to your own server. (This step is important. Please do not link directly to the image on this server.)
Add this HTML to your page (change the image src
attribute if necessary):
If you would like to create a text link instead, here is the URL you can use:
http://www.feedvalidator.org/check.cgi?url=http%3A//feeds.feedburner.com/RkjTechBlog