Archive for June, 2021

What skills does a financial software developer need?

The below network diagram is intended as an outline of the skill set required for a financial software developer.
Note:

  • Most individuals should aim to have a strong core. Think of it like a pyramid, where the height is the strength of a skill. Core skills like general computing principles, probability, communication should be built “tall” and very strong.  peripheral skills such as python/monitoring will be weaker. An individual will typically only learn 1 or 2 niche areas strongly (T shaped)
  • Notice the rough relative sizes of the areas. 55% computing, 10% math, 15% soft skills, 20% finance. This is intended to represent the rough allocation of effort.
    • If you only bring 95% techinical skills, you are going to waste time building the wrong thing, build something no one wants or build something useful but no one will know as you haven’t the soft skills to sell it.
  • The management branch on the bottom left is optional.

Financial Software Developer Skill Mindmap

Computing

Skill Topic Sub-Topic Links Requirement
basics
linux surrey Change Directories, edit config files, kill processes, copy/move files, check disk space.
bash tldp.org Write a script to periodically sync a directory between servers and schedule it using cron.
git git-scm Checkout, branch, commit, push code.
Common Tools: jira/jenkins user-stories Write a good jira, assign owner. Kick off a build on a common CD platform.
Programming Language Knowledge of 2 different programming paradigms.
kdb kdb-tree Write efficient selects for pulling back a subset of data.
python Download data from a REST api, calculate average/mean/median for certain metrics.
java book Write a java program to count the number  of words in a file.
Databases Be aware of the major types of database available and when to use each.
kdb kdb-tree
mysql/postgresql/oracle/ms Know standard SQL.
Software Engineeering peopleware How to grow good software.
Good Software Properties of good software with examples.
Architecture Common Enterprise software patterns.
Distributed Systems Difficulties with distributed systems and common patterns to solve them.
Data Processing Pipelines Common processing Pipeline Patterns
Site Reliability Engineering SRE How reliable should software be?
Metrics Ccommon metrics used to measure reliability and when to use each
Monitoring What monitoring systems/tools are available? What are monitoring best practices?
Releases Accelerate
Support Handling outages. Engaging with users.
Software Concepts Testing Testing Methods and knowledge of one test framework
User Interfaces dont-think What makes a good user interface?
Networking How computers connect. Expected latency/bandwidth.
TCP/IP ports, switches, racks, data centres, windows.
Middleware Messaging midddleware: solace/JMS/kafka/MQ.
Software Development
agile
scrum book Sprints, iterations, standups, restrospectives, story-time.
Lean Development lean-startup When, why and how to develop lean.
Code Reviews pragma Code review best practices.

Soft Skills

Skill Topic Sub-Topic Links Requirement
IT Skills
excel Create a table with conditional formatting, calculate sum/average of column, use vlookup
outlook Filter emails, create meetings.
Communication
Emails How to write an email to users, team mates, managers, senior management.
Meetings atlassian What is a meeting meant to accomplish? How to achieve that.
powerpoint Prepare a presentation for management.
visio Draw an architectural diagram of your system.
Sales
Marketing Traction How to get your software used and appreciate by more users.
Support
Networking Building a network to get things done.
Management Grove
Building a Team Dysfunctions
One to Ones What makes a good one-to-one
Interviews How to evaluate cnadidates effectively.
Quality How to ensure quality of product.
Organizational Structures phoenix Different structures for management.
Project Management
Roadmaps
Decision Making Which approach to decision making to  use when.

kdd 2019 Roadmap – Cloud and Community

This article was drafted in 2019, given it’s 2021 it made an interesting look back and sadly still a look forward….

There have been two big changes in the software world:

  • The Cloud
  • Community collaboration

Winners and Losers

From these shifts, there have been winners and losers

  • Community helped Wikipedia build the best encyclopaedia
    relegating Britannica and Encarta to history.
  • Community helped linux become the dominant operating system
    Solaris/OS2 systems are now, only used in legacy niches
  • Community developed python is replacing matlab
  • Cloud has seen atlassian/github/amazon/salesforce etc . win by offering SAAS solutions
    to replace what would previously have been locally installed software (SAP/perforce)
  • Cloud hosted Gmail/Hotmail has replaced companies running their own mail servers

If kdb doesn’t change it will become a legacy platform with developers maintaining legacy systems that over time will be replaced with modern cloud alternatives.

Therefore we are starting two initiatives:

Cloud native KDB

  • A fully-managed time-series database hosted on google cloud
  • Able to be signed up for and used within 10 minutes
  • Clear predictable pricing based on storage and query usage
  • Hiding all the complexity of kdb (no par.txt/segments/sym file manipulation)
  • While providing access to the speed and expressiveness of the language
  • Taking advantage of modern load balancing (kubernetes) And cheap storage (S3)

We have a skunkworks team based in their own office, tasked with making a kdb database cloud solution so reliable and feature rich your kdb expert can now stop working to keep the database running and instead focus on business problems.

Community Driven q

We want kdb to run everywhere, for the barriers to adoption to drop and for the language to expand what it can do. A new kdb user will be able to run kdb on their machine through their standard package manager and to access a whole library of utilities to help them with whatever task they are trying to achieve.

KDB everywhere

To do this, we’ve formed a committee including representatives from finance/education and the wider community to:

  • Open source the q language
    • Development possibilities will be opened up to the wider community as anyone can submit ideas or even PRs for experimental functionality
    • Being open source allows kdb to be bundled with linux and we see this as allowing wider use of q scripts
  • Create a hosted packaging system that allows reusing code easily similar to NPM/maven
    • Providing a wider library of community maintained packages that are easily reusable
    • Work with aquaq to migrate parts of their torq framework to provide a kdb standard library
    • Work with the community to onboard some of their code as packages
      e.g. TimeStored is donating qunit
  • Provide a recommended SDLC for kdb. Over the years we’ve developed processes for end to end development of q code at scale and we will be providing that same tooling to everyone.

By both open sourcing the language and allowing easier development of shareable packages we accelerate the pace at which kdb can help all developers solve problems and share solutions. Making the kdb platform stronger for everyone.

The Future of kdb is with you

It’s an exciting time and the demand for storing and analysing large time-series is growing. We believe by becoming cloud first and community driven we can continue to provide solutions for many years to come.

kdb – Feature Wishlist

Features I want:

  1. Open Sourced kdb (a person can dream). As one of the top 5 tools in my programmers toolbox it’s frustrating that kdb is closed source. I can’t use the tool everywhere and at any time the price can be increased.
  2. Increase ease of Use
    • Block user queries that will obviously kill the database (select from quote).
    • Do not quit out when a query takes too much memory (-w exceeded or all RAM/swap on box gone.). Sensibly return an error and keep going.
  3. Faster Speed – Admittedly this isn’t a strong requirement for any work I do but it irritates me as a programmer to know some easy 10x speed improvements are not being used.
    • Perform warmup queries and counts on startup automatically to get most recent data into memory.
    • Replace the kdb/q code with CPU vector functions
    • Parse the user query and optimize it. If a user sends “select from trade where a=1,b=2,c=3,d=3” automatically order the evaluation of the where clause to at leaast prioritize those with attributes.
  4. Marketing – I didn’t think this would be on my wishlist…but if you can market kdb better I would love to stop having people suggest I use mongodb/hadoop/latestFad when kdb is a great fit for the problem at hand.