This is an old revision of the document!
A monthly budgeting application, to plan a budget and stick to it.
The planning process can easily be done with a spreadsheet for now, but budget analysis is going to be time-consuming that way. So, the most beneficial application to write would be a simple budget tracker, which uses something like a naive bayesian classifier to allocate a category to each transaction and then produce monthly totals in different categories.
A transaction can be expected to consist of the following minimum bits of information:
Transactions may also optionally include:
All of these items can be used in the classification of the transaction, but first they must be transformed into appropriate tokens. This is done separately for each item as follows:
|Date|| The month, day of the month, day of the week and the nth occurrence of that day within the month are all converted into tokens. For example, 2013-03-27 might yield tokens
|Description|| The description is split into tokens of alphanumerics using any other character as a separator and forced to lowercase. For example, the string
These tokens are concatenated into a single list and these are all treated equally for the purposes of the classifier. This allows the classifier to learn which are the reliable indicators of any particular categorisation.
The previous contents of this page, for posterity.
: Delete this
Also contains design notes for the pysf library.