Blog Archives

A couple of things about Python Collections (deque and defaultdict)

8/27/2015

This is a "sandbox" post which I use as a brain dump for code I find interesting. You know that cool stuff that, because you might not use all the time, it's easy to forget.

For the code and the testing (always test, I’m serious) look at my repo sandbox in github.
https://github.com/zom-pro/sandbox.

This is post is mainly inspired by those moments when you look at the code you wrote 6 months ago and want to slap yourself. Normally, you wish nobody who knows you will ever see that code neither. This happens to me sometimes when I discover a new cool (maybe a bit obscure) data structure in a language.

In this post I'll show a couple of things that can be done with the deque and defaultdict collections library of Python 3.4. There're other interesting objects in collections such as OrderedDict, but they are quite straightforward. For more info, just take a look at the official collections doc.

There're two main reasons for me when looking for slightly more advanced data structures: performance and code readability.

defaultdict

This data structure comes handy when you have to generate missing keys in dicts. Let’s look at a comparisons between how I used to do it and how I do it know. These guys are great examples of improved readability.

sparse matrix

Let's say I want to create a sparse matrix representation using a dictionary. The key would be a row, column tuple and if it doesn't have a value associated to it, it will return 0. In the past, I would've done something like this:

class MyDict():
def __init__(self, default_value):
# default value to be used when key isn't found.
self.default_value = default_value
self._d = dict()
def __setitem__(self, key, value):
return self._d.__setitem__(key, value)
def __getitem__(self, key):
try:
return self._d.__getitem__(key)
except KeyError:
return self.default_value
def __delitem__(self, key):
return self._d.__delitem__(key)

my_dict = MyDict(0)

But what about using defaultdict

def default():
return 0
my_dict = defaultdict(default);

counting appearances

before collections....
def counting_appearance_old(sample):
d = {}
for key in sample:
if key not in d.keys():
d[key] = 0
d[key] += 1
return d

after collections...
def counting_appearance(sample):
d = defaultdict(int)
for k in sample:
d[k] += 1
return d

Not an enormous difference, but not doubt more straightforward.

a dictionary of list like the example in the docs?

old school...
def dict_of_lists_old(sample):
d = {}
for k, v in sample:
if k not in d.keys():
d[k] = []
d[k].append(v)
return d

new generation
def dict_of_lists(sample):
d = defaultdict(list)
for k, v in sample:
d[k].append(v)
return d

deque

moving average

This guy is more commonly (or at least I've seen it more often) used as the tail filter in Unix. However, it comes in handy when calculating moving average and other similar "moving" calculations. Let's take a look, lets calculate moving average of a list with a moving group of 3 numbers (here it's crazy important to test, take a look at my github).

without collections.deque I would have done something like this

def old_moving_average(iterable, n=3):
len_iterable = len(iterable)
for e, i in enumerate(iterable):
# necessary to stop it before reaches the end.
if e <= len_iterable - n:
yield sum(iterable[e:n + e]) / n

with deque
def moving_average(iterable, n=3):
it = iter(iterable)
d = deque(itertools.islice(it, n - 1))
d.appendleft(0)
s = sum(d)
for elem in it:
s += elem - d.popleft()
d.append(elem)
yield s / n

Well, readability hasn't improve as far as I can see. It actually involves a number of stuff that wasn't there before!!! so why on earth would I use it?
well, I've done this type of calculation with finance-related stuff and normally sample sizes explode quite quickly. So let's look at performance and you will see why. Running a timeit with number=1000 and increasingly larger lists, the results are:

Length deque: 100 time: 0.20786042507368704
Length normal: 100 time: 0.4498787968021777

Length deque: 1000 time: 2.0155418775699676
Length normal: 1000 time: 4.767286307571519

Length deque: 10000 time: 20.771743648427435
Length normal: 10000 time: 28.09181877427863

Length deque: 100000 time: 53.21926311771493
Length normal: 100000 time: 140.31067571029496

yup...
As always, I'm aware things can be done differently and results will vary. But as I said, this is just a brain dump, hopefully you saw something new around.

0 Comments

Non-speculative portfolio - Part 2 what instrument to buy and which type

8/20/2015

0 Comments

Disclaimer: I'm not a financial adviser and this is NOT a financial advice (might not even be right). So don't consider it as a financial advise for any of your financial decisions.

This is Part 2. For part 1, go here: http://zombieprojects.weebly.com/blog/non-speculative-portfolio-part-1-introduction-and-objective

1. What to buy: shares and bonds vs mutual funds (vs ETF)

My conclusion is that when you're starting with investment, mutual funds are a no-brainer. The main reasons are:

They are cheap [this is something that Lars discusses in depth. For an investment to beat a cheap mutual fund is has to do really good. Graham also mentions it saying that it is easy to get the same results as the market but doing better is a lot more difficult.]

They provide -very importantly- good diversification out of the box and they require a lot less research work. Ase mentioned before, Graham and Lars agree hammer this point several times.
They keep your instruments up-to-date. Let's say for example you want to keep a certain bond maturity a good mutual fund will do this in your behalf [rather than having to be constantly buying and selling new bonds as they mature].

In order to be able to buy shares individually, a lot of research and financial statements understanding is required. In order to diversify properly in bonds such as government bonds, a lot of cash is required [normally the minimum to buy a bond is 10k so you can't do much with less than 100k]. In the case of corporate bonds, it boils down to the same problem as the shares.

2. What sub-type should you buy?
Having concluded that mutual funds are the way to go, which ones should I buy? Well, according to Lars, the cheapest ones. This is an important point because as Graham's book mentions, there has been studies demonstrating that expensive funds tends to have worst performance than cheaper ones. Lars has a pretty good explanation of the best instruments to buy in relation to what you get out of them, risk and other aspects. It boils down to 4 sub-classes of funds:

high quality government bonds,
worldwide corporate bonds,
worldwide shares in the form of trackers and
risky (sub-AA) government bonds.

Each one of these instruments has a different risk associated with it and the yield should move together with this risk. I think, the mix makes sense because your diversification will be absolute. This is probably the best way to protect you from market fluctuations. The key is the percentage in each one of them which is what I'll discuss in the next chapter.

When choosing bonds, Graham's also raises the question of whether you should be going for taxable or tax-free bonds but it doesn't really applies to the alternatives of funds I've seen so far -this might change in the future-. Also, when the inflation is at a low price like now (2015 UK-US), you want to go for inflation linked bonds that will not loose value when the inflation goes up (in the case of government bonds). In order to fight a rise in the interest rate, short term bonds are ideal [interest rate and inflation going up will mean that bonds price go down. Here in the UK the interest rate is mainly used to control inflation so they are quite correlated].

There is more to be said about inflation-linked bonds from a stable governments but Lars conclude that they are the most secure investment you can find. Because of this very low risk, you expect to earn very little out of them. For example at the moment they have a yield of zero. Non-inflation linked bonds give a bit more, so perhaps you want to go for a mixture of both.

Finding a short-term only inflation linked government bond mutual fund would be ideal, but as I'll describe in the implementation section, it isn't always possible.

Finally, using a tax-efficient wrapper such as a (N)ISA -or whatever similar depending where you’re- is probably the best idea -but you need to bare in mind the platform costs-.

Stay tuned for the next article

0 Comments

Non-speculative portfolio - Part 1 Introduction and Objective

8/10/2015

0 Comments

Before you read this post!

Disclaimer: I'm not a financial adviser and this is NOT a financial advice (might not even be right). So don't consider it as a financial advise for any of your financial decisions.

Important Note: the content of this post is merely a "brain-dump". I'm not an expert in investment value and the idea of this post is for me not to forget what I've learned so far. Specially, after reading a book, it's easy to forget little details so I'm going to keep them here. If for any reason you get something useful out of this post, then great.

BTW, is this article TL;DR? Just skip the sections between square brackets [ ].

I’m going to be writing a serie of articles to remember what I’ve learned about putting together a conservative investment portfolio.

So far, I'm basing my portfolio configuration mainly in two books: The Intelligent Investor by Benjamin Graham (2003) and Investing Demystified: How to Invest Without Speculation and Sleepless Nights from Lars Kroijer (2013) and some extra knowledge I've acquired so far (blogs, articles, other books, etc.).

[I intend to keep reading about this topic and I'll include more info as I come across interesting stuff. I'll discard magic and other difficult to swallow stuff like 2 or 3 books of aggressive investment I read before and an (luckily very cheap) online course I did.]

If you are an investor everything I’m writing here is going to be extremely obvious - because that's the way it should be -. As mentioned before, this is a simple portfolio which objective is to get the same results as the market. This is a point mentioned several time in Benjamin's book [where I come from, we call people by their first name please don't be offended if you're a fervent follower of Benjamin Graham and you think I'm not being respectful enough]. Lars also bases his investing philosophy in the impossibility for you to beat the markets -or to have an edge-.

[ I strongly agree with this. In danger of over simplifying the argument, let's assume you're trying to beat the market using fundamental or/and technical analysis. As you can read from Graham's book if you want to do a proper fundamental analysis, you need to fully understand the companies' financial statements. At university, I did an accountancy course and one of my tasks was to dig information out of one of these statements and it isn't easy. Actually, it's as obscure as it gets. Pretty much, a very smart guy who knows a lot more than you is trying to make it as obscure as possible for you to understand -it isn't always this way, but it might be and in the most dangerous case will be-. Graham's book shows the same through the chapters related to understanding the company's financial status. That's a no-no for me based on the fact that I'm not putting money based on something I'm not sure I got it right.
What about technical analysis? Well, you'll be competing with algorithms experts which competitive difference is based on the connection cables being straight rather than curved. They have resources you only dream of -computers and data-. So if you feel comfortably confident of being able to be able to write a better multi-thread multi-core algorithm, in a close to the metal programming language and want to put 100k in equipment to run it, go for it!].

In order to be able to put together a portfolio like this, we need to solver a number of issues first.

1) What type of assets we want to buy? For example, bonds vs bonds-based mutual funds -same for shares-. I'll call this the type of instrument.

2) Once the instrument has been defined, what sub-type should you buy? Junk company bonds vs stable country bonds, or a given shares sector vs index trackers, etc.

3) How much of each instrument and sub-type should you buy? 50%-50% bonds-shares and how the market's price affects this decision.

4) How do you buy it? Do you use funds-supermarket and tax efficient wrappers or do you go to scream a price to your local stock market? This is quite an important one. One of the point things I’ve figured it out is that theory clashes very quickly with the practice in this area -specially when you realise you need 100k minimum to buy directly with x company-. Also, there are platform costs and other things to bare in mind.

5) What are the expected results and projections based on different scenarios? What would you like to achieve? 7% annual after or before inflation? How does inflation affects this? I’m going to hit this last point with a number of further articles explaining how I’m putting together a -simple- predictive model. Everything I do here will be located in my Github account so you can access it.

Stay tuned for Part 2.

Introduction and Objective

0 Comments

A couple of things about Python Collections (deque and defaultdict)

defaultdict

sparse matrix

counting appearances

a dictionary of list like the example in the docs?

deque

moving average

Non-speculative portfolio - Part 2 what instrument to buy and which type

Non-speculative portfolio - Part 1 Introduction and Objective

Before you read this post!

Introduction and Objective

Categories

Archives