Tom Hipwell

Tom Hipwellhttps://tomhipwell.co/Recent content on Tom HipwellHugo -- gohugo.ioen-gbWed, 12 Jun 2024 16:10:56 +0000AGI Predictionshttps://tomhipwell.co/blog/on_agi_predictions/Wed, 12 Jun 2024 16:10:56 +0000https://tomhipwell.co/blog/on_agi_predictions/I really enjoyed the nonint post on timelines to AGI. Obviously James Betker is better placed than me to make an informed prediction, and he has inside information (he works for OpenAI) but there’s a couple of things that jump out at me if I read this prediction critically. Firstly, given that transformers are great general approximators of behaviour, it’s very difficult to falsify any predictions about AGI without having a very specific and testable definition of what AGI is that everyone agrees on.dspy unpacked: continuous prompt optimisationhttps://tomhipwell.co/blog/dspy/Fri, 26 Apr 2024 15:10:56 +0000https://tomhipwell.co/blog/dspy/Omar Khattab, Chris Potts, Matei Zaharia | 2023 | Paper | Github | Docs A lot of work with LLMs today involves working through a loop where you break a problem into steps, write a prompt for each step, then put the whole together by adjusting each prompt to feed into the next one. dspy simplifies this process. It gives you a framework to structure your pipeline - forcing you to architect the application so your program flow is split from the variable stuff - the prompts and model weights that get fed to the LLM.How many customer interviews are enough?https://tomhipwell.co/blog/how_many_customer_interviews/Thu, 18 Apr 2024 13:30:00 +0000https://tomhipwell.co/blog/how_many_customer_interviews/Counts of customer interviews seem to have become a bit of a vanity metric of late. A shorthand for product or decision quality, as if one automatically implies the other. I appreciate your sacrifice at the temple of customer research, but I worry that you may have wasted your time. Working out the right number of interviews, wireframe tests or customers in the alpha phase of your project is quite similar to an optimal stopping problem.llm.c: The genius of Andrej Karpathyhttps://tomhipwell.co/blog/llm_c/Thu, 11 Apr 2024 20:19:00 +0000https://tomhipwell.co/blog/llm_c/What’s awesome about Andrej Karpathy’s llm.c isn’t just that it’s a bare-metal, from-scratch implementation of GPT-2 (safety wink definitely required!). If you take a step back, you’ll see he’s also educating us on how one of the very best in the world hones their craft. He’s stripped away the intermediate layer of libraries - there’s no PyTorch here. Instead, we’re taken back to the basics: an attempt to implement a simple C and CUDA version of GPT-2 in ~1000 lines with no dependencies or frameworks involved.March '24 Rounduphttps://tomhipwell.co/blog/march_24_roundup/Sun, 31 Mar 2024 11:20:00 +0000https://tomhipwell.co/blog/march_24_roundup/March was the month we got Grok, OpenAI confirmed their strategy and we no longer needed to run on vibes alone as gpt-4 was displaced at the top of the leaderboards. An experiment was also kicked off to learn about the pricing power of the major LLM providers. One of the things I most enjoyed this month was the explosion of interest in LLM agents with the launch of Devin, the AI software engineer.Hot takes on Devin, the AI software engineerhttps://tomhipwell.co/blog/devin/Sun, 17 Mar 2024 15:10:56 +0000https://tomhipwell.co/blog/devin/I thought Devin from Cognition looked super cool this week, the UX feels like a glimpse of a new era. I wonder how deep the moat is though? 🤔 From staring a little bit too closely at the screenshots and videos I’ve seen so far, a hot take would be that it feels like most of the performance lift in the SWE benchmarks could come from a switch in prompting technique, i.Grandmaster-Level Chess Without Searchhttps://tomhipwell.co/reading/grandmaster_chess_without_search/Sat, 16 Mar 2024 08:55:30 +0000https://tomhipwell.co/reading/grandmaster_chess_without_search/Anian Ruoss, Grégoire Delétang, Sourabh Medapati, Jordi Grau-Moya, Li Kevin Wenliang, Elliot Catt, John Reid and Tim Genewein | Grandmaster Level Chess without Search | 2024 | Paper Walter Isaacson | Elon Musk | 2023 | Book Towards the end of Walter Isaacson’s biography of Elon Musk, there’s a description of a breakthrough with Tesla Autopilot: For years, Tesla’s Autopilot system relied on a rules-based approach. It took visual data from a car’s cameras and identified such things as lane markings, pedestrians, vehicles, traffic signals, and a set of rules in a range of eight cameras.February '24 Rounduphttps://tomhipwell.co/blog/february_24_roundup/Mon, 26 Feb 2024 11:20:00 +0000https://tomhipwell.co/blog/february_24_roundup/February feels like it’s gone in a blur. Hofy had a brilliant company retreat in Peniche, Portugal. Sora looks insane. Google returned to open source AI with the Gemma series while Mistral released a hosted, closed-source model. Here’s a few other things that caught the eye: Self-Discover, Google DeepMind Can we improve LLM reasoning by adjusting the way in which we prompt? Google DeepMind demonstrate an up-to 32% uplift in performance that transfers across LLMs (GPT-4, GPT-3.LLMs as classifiershttps://tomhipwell.co/reading/llms_as_classifiers/Fri, 09 Feb 2024 08:55:30 +0000https://tomhipwell.co/reading/llms_as_classifiers/Lefteris Loukas, Ilias Stogiannidis, Odysseas Diamantopoulos, Prodromos Malakasiotis, Stavros Vassos | 2023 | Paper When I’ve heard folks talking about AI strategy recently a common trope has been that things are moving so fast that it is better to hold off product investments until the pace of change slows and the stack starts to stabilise. Instead we should be focussing on the low hanging fruit from the productivity lifts of using chat assistants or GPTs.January '24 Rounduphttps://tomhipwell.co/blog/january_roundup/Tue, 30 Jan 2024 14:15:00 +0000https://tomhipwell.co/blog/january_roundup/End of month one already so its time for a round up. Here’s a few different bits I found interesting in January: Supervised fine-tuning (SFT), Niels Rogge All the steps in going from a base model to a useful assistant using supervised fine tuning. A slightly deeper run through of fine tuning Mistral-7B end to end, with the details coloured in - from hardware sizing to a run through the different PEFT approaches (PEFT is parameter efficient fine tuning, e.Things I'd like to learn in 2024https://tomhipwell.co/blog/2024_in_learning/Wed, 17 Jan 2024 13:19:00 +0000https://tomhipwell.co/blog/2024_in_learning/I guess like everyone in 2023, I’ve thought a lot about LLMs, LMMs and all the rest of it. As an interested bystander and casual observer, I thought I’d stake out three things that I’m curious to learn more about during the course of 2024 as I try and get that bit closer to the edge. If you have similar thoughts, can correct the gaps in my reasoning or are further along the curve and can signpost me to some good reading on these themes, I’d love it.2023 In Reviewhttps://tomhipwell.co/blog/2023_in_review/Thu, 21 Dec 2023 14:20:00 +0000https://tomhipwell.co/blog/2023_in_review/2023 was an incredible year in our industry, so I thought I’d look back and share the things I’ve loved reading, watching, learning and doing this year. Blogs The Github one on Copilot, a slow and high level reveal around how Copilot is put together. Also, Jaccard similarity ftw! LLM Patterns, Eugene Yan’s summary post back in the Summer described a bunch of reference architectures for an emerging field for the first time.Modern code review: a case study at Googlehttps://tomhipwell.co/reading/modern_code_review/Thu, 07 Dec 2023 10:59:10 +0000https://tomhipwell.co/reading/modern_code_review/Caitlin Sadowski, Emma Söderberg, Luke Church, Michal Sipko, Alberto Bacchelli | 2018 | Paper Benchmarks for Code Review It’s handy as we talk about code review to use some benchmarks that anchor our expectations for review performance in data. The best that I know of are in a 2018 paper from Google: “Modern Code Review: A Case Study at Google”. I find these helpful when breaking down qualiative feedback about reviews as if you can get the data, you can start to get a feel for where improvements in your review process can come from.What Great Looks Likehttps://tomhipwell.co/templates/what_great_looks_like/Fri, 08 Sep 2023 14:20:00 +0000https://tomhipwell.co/templates/what_great_looks_like/PreviewMarkdown What Great Looks Like: NAME - TITLE ⚠️ This is a work in progress, the steps to get this live are: (1) writes a first draft from this template, (2) we review it together in a 1:1 (3) we re-draft based on feedback (4) we share it with the organisation as a whole so everyone has clarity on your role and responsibilities. Why are we doing this? We’re trying to remove ambiguity, if we do the hard thing and get specific about what we expect from a role it can help us identify strengths (so we can work on amplifying them) and areas for improvement (so you can improve your skills).The SPACE of developer productivity: There's more to it than you thinkhttps://tomhipwell.co/reading/the_space_of_developer_productivity/Mon, 03 Jul 2023 19:01:30 +0000https://tomhipwell.co/reading/the_space_of_developer_productivity/Nicole Forsgren, Margaret-Anne Storey, Chandra Maddila, Thomas Zimmermann, Brian Houck, Jenna Butler | 2021 | Paper Summary I read the devex paper by a few of the same team recently so I went back and read the orignal SPACE paper as well. This one is a bit more famous, it was widely syndicated on publication in 2021 and the ideas behind SPACE have seeped into the general engineering consciousness over the past couple of years.Developer experience: What actually drives productivity?https://tomhipwell.co/reading/dev_ex_what_actually_drives_productivity/Thu, 29 Jun 2023 18:01:30 +0000https://tomhipwell.co/reading/dev_ex_what_actually_drives_productivity/Abi Noda, Margaret-Anne Storey, Nicole Forsgren, Michaela Greiler | 2023 | Paper Summary Devloper Experience or DX focuses on the lived experience of developers and the points of friction they encounter in their everyday work. In addition to improving productivity, Developer experience can drive business performance through increased efficiency, product quality and employee retention. Quite a few recent approaches in this space have focussed solely on engineering metrics to measure whether an engineering organisation can be classed as “elite”.Product Manager Hiring at a European Deep Tech Startuphttps://tomhipwell.co/blog/pm_experience/Tue, 16 May 2023 14:20:00 +0000https://tomhipwell.co/blog/pm_experience/Hiring Product Managers for Deep Tech I was catching up with a peer earlier this week and towards the end of our call, they threw out an interesting question - if you were hiring a product manager (PM) for our team right now, what profile would you hire? For context, the startup they worked for was a european deep tech startup. Their current team size is around 15, with 5-7 members of that team in engineering.Quick Technical Decisions: Trade-off, Pay-off, Decision, Communicationhttps://tomhipwell.co/blog/quick_technical_decisions/Tue, 09 May 2023 12:55:56 +0000https://tomhipwell.co/blog/quick_technical_decisions/A skill that a lot of engineers develop as they move towards a Senior level is the ability to make quick technical decisions verbally or in short form (say, a Slack post). Mastering this skill allows for triage: where do I need to pause and spend time working through or spiking the idea with peers, where can I just make the decision and keep moving under my own steam, and what should I set to one side for another time?High Output Managementhttps://tomhipwell.co/decks/hom/Tue, 02 May 2023 00:00:00 +0000https://tomhipwell.co/decks/hom/Andy Grove’s masterpiece on management and leadership was a book I reached for as I transitioned from a role as a senior IC to leading an engineering organisation for the first time. To improve my practice, I created this deck to break down the core lessons from the book and keep them at the forefront of my mind as I went about my day-to-day activities. The book provides guidance on effective communication, setting clear goals, and offers techniques for measuring and improving productivity.Abouthttps://tomhipwell.co/about/Fri, 28 Apr 2023 00:00:00 +0000https://tomhipwell.co/about/I lead technology at Hofy - a Series B startup revolutioning how we equip the world’s talent. I’m a technology leader with 15 years of engineering and product development experience at hypergrowth startups, scaleups and Fortune 100 enterprises. I live in South Devon, England with my wife Hannah, our two daughters Margot and Alma and our dog, Woody.Copyleft Licenseshttps://tomhipwell.co/blog/license_checking/Sun, 16 Apr 2023 11:10:56 +0000https://tomhipwell.co/blog/license_checking/License Checking I’ve been through a couple of due diligence processes now (sales, funding rounds), and one question that always gets asked is whether we have any dependencies with a copyleft (viral) license in our codebases. This includes the full dependency tree for every dependency so it can be a bit of a scramble working this out (and fixing it) across many repos in a short space of time. Having been bitten by this a couple of times I’ve now learnt that it’s a good practice to get a basic license checker wired into your CI for each repo nice and early - stopping it from becoming a problem in the first place as you can fail the release if the license is viral.1:1 Templatehttps://tomhipwell.co/templates/1to1/Mon, 27 Mar 2023 13:25:50 +0000https://tomhipwell.co/templates/1to1/PreviewMarkdown XX <> TH Important Links Kick Off 1:1 Brag Document ⏫ Growth Tracker Growth We should be catching up on your growth every four weeks, our last growth chat was: Never! First one planned for xx xx Minutes This is your agenda, so please maintain it on a week to week basis! DD MM 202X Pulse check: what’s on your mind? Kick Off 1:1 Brag Document What are 1:1s for?Technical Design Documenthttps://tomhipwell.co/templates/tdd/Fri, 10 Mar 2023 10:45:00 +0000https://tomhipwell.co/templates/tdd/PreviewMarkdown TDD Template How to use this document (remove once read) Use it as a checklist to make sure you cover all bases, remove anything you don’t need. Work fast and light. A good TDD is <1000 words and it includes one diagram (that’s two pages, not including the 525 words for the template). Write it in the open (no private documents!). Once it’s in Ready state, share it with everyone.Decision Documenthttps://tomhipwell.co/templates/decision/Wed, 08 Mar 2023 13:37:52 +0000https://tomhipwell.co/templates/decision/PreviewMarkdown Decision Template Maximum 1500 words. Provide links to supporting documentation inline. Fill in the RAPID header block. Write the Problem statement [Optional] draft the Context block Define the Principles we must follow in making this decision Define the Dimensions we will measure the options against Develop the Options Ensure the Context block has all necessary background information Write the Recommendation Delete these instructions Date Recommend Agree Perform Input Decide Status DRAFT Recommendation / Decision Start with the point.VS Codehttps://tomhipwell.co/decks/vscode/Fri, 10 Feb 2023 15:30:00 +0000https://tomhipwell.co/decks/vscode/I’ve done a lot of technical hiring and a part of that process has always been a hands on paired programming exercise. For me, one of the traits of the better candidates is deep familiarity with their IDE or editor - they’ve spent time learning about their tool of choice and had a strong taste for the how and why of it’s configuration. Typically the candidates who checked, adjusted or discussed the configuration would finish the exercise much more quickly and comfortably, so it was normally a leading indicator early in the interview that things were going to go well.Company Strategyhttps://tomhipwell.co/templates/strategy/Mon, 05 Dec 2022 09:15:00 +0000https://tomhipwell.co/templates/strategy/PreviewMarkdown Strategy Template What is strategy? Strategy is not a roadmap, it is a set of powerful choices that combine to position the company to win over the next n (e.g. 3-5) years. There are 5 key choices in Strategy. What is our winning aspiration? Where will we play? How will we win where we have chosen to play? What capabilities must be in place to win? What management systems are required to ensure the capabilities are in place?Superforecastinghttps://tomhipwell.co/decks/superforecasting/Thu, 01 Dec 2022 09:45:00 +0000https://tomhipwell.co/decks/superforecasting/In work (and life) we’re making (and receiving) predictions all the time - from when that all important project is going to ship to how a core business metric is going to move over the next few weeks via how long it might take to hire that critical role. Working to refine and understand this skill felt important, and Philip Tetlock’s book sets out a few core behaviours and techniques used by the very best.Multipliershttps://tomhipwell.co/decks/multipliers/Fri, 18 Nov 2022 09:00:00 +0000https://tomhipwell.co/decks/multipliers/Multipliers are leaders who amplify the intelligence and capabilities of their teams. This book is another resource I’ve reached for as I’ve transitioned into being an engineering leader. It explores techniques for unlocking the full potential of team members, fostering a culture of collaboration, and empowering individuals to contribute their best work. This can impact individual growth, innovation and problem solving ability. The deck summarises the key lessons from the book, so the techniques (e.Flowhttps://tomhipwell.co/decks/flow/Sat, 05 Nov 2022 15:30:00 +0000https://tomhipwell.co/decks/flow/Mihaly Csikszentmihalyi’s book “Flow” explores the concept of flow, a state of optimal human experience where individuals are fully immersed and highly focused on an activity. Understanding flow, the structure of flow like experience and how to plan for and cultivate flow in your life and work can increase how much you enjoy activities like programming that are very conducive to flow experiences. We can use our understanding of flow to achieve increased focus, productivity and creativity in our work.Regexhttps://tomhipwell.co/decks/regex/Thu, 20 Oct 2022 14:45:00 +0000https://tomhipwell.co/decks/regex/Early in my career watching other engineers pattern match across a codebase or some logs using regex felt like some kind of inaccessible, dark magic - they could find (and replace) what they were looking ten times faster than I could. If you’ve not spent time learning the ins and outs of regular expressions and committing them to memory then this is an easy way of boosting your productivity - they pop up everywhere you need a string operation and in every context.Bashhttps://tomhipwell.co/decks/bash/Sat, 15 Oct 2022 18:30:00 +0000https://tomhipwell.co/decks/bash/Getting some ’nix super powers is part of the secret sauce that most high performing engineers have, and this deck can help bootstrap that process, fill in gaps or retain muscle memory if you’re spending less time in the terminal than you used to. The majority of the cards are built based on notes from the book The Linux Command Line which is available for free under a creative commons license and well worth a read.Oh-My-Zshhttps://tomhipwell.co/decks/ohmyzsh/Sat, 15 Oct 2022 18:30:00 +0000https://tomhipwell.co/decks/ohmyzsh/Oh-my-zsh is another little bit of engineering secret sauce, though if you’re fresh to using the framework I’d probably carefully review alternatives and find something that closely fits your workflow before diving in. The deck summarises a bunch of handy aliases for different combinations of git, yarn, kubectl and python commands. Mileage might vary (I’d suggest building your own deck instead of using this one) but I include it here as an example as I’ve found that learning the aliases nudged me towards better practice as it changed my defaults and this in turn improved the quality of my work and my productivity.Getting Things Donehttps://tomhipwell.co/decks/gtd/Fri, 14 Oct 2022 10:54:39 +0000https://tomhipwell.co/decks/gtd/Getting things done is so ubiquitous that it probably needs no introduction, but as my career developed and my home life grew more complex I needed to improve how I managed my time and that meant leaving behind some bad habits. This book and this deck helped with that transformation and while I’m now quite a few years down the line with that process I still keep working through these cards to keep the core principles of GTD front of mind, so I can use it in my own life and to coach others.Security Reviewhttps://tomhipwell.co/templates/security_review/Thu, 13 Oct 2022 16:59:37 +0000https://tomhipwell.co/templates/security_review/PreviewMarkdown Security Review Checklist (TEMPLATE) Completed By: @yourself Date: **@Today** You should be able to find out the majority of the answers to these questions by browsing the vendor’s website. If you’re struggling then it may be worth scheduling a call with a sales rep and seeing if you can answer the questions. If you’re still having issues or this is a major account and you think it needs some extra scrutiny, then reach out to <who’s in charge here?Product Requirements Documenthttps://tomhipwell.co/templates/prd/Tue, 27 Sep 2022 14:20:00 +0000https://tomhipwell.co/templates/prd/PreviewMarkdown PRD: [PRD TEMPLATE] How to use this document Use it as a checklist to make sure you cover all bases, remove anything you don’t need. Work fast and light. A good PRD is <1000 words and it includes diagrams (that’s two pages, not including the 525 words for the template). Write it in the open (no private documents!). Once it’s in Ready state, share it with everyone via Slack.Githttps://tomhipwell.co/decks/git/Sun, 28 Aug 2022 12:00:00 +0000https://tomhipwell.co/decks/git/Helps with memorising git commands and flags to get you going a bit faster in the terminal. Knowing some of the ins and outs of git is good for getting into a flow state quicker when working (what I mean here is that grepping back through your bash history or googling for that command or flag you need doesn’t interrupt you, you can remain focussed on writing the code). Covers some deeper bits of git as well (i.Kick Off 1:1https://tomhipwell.co/templates/kickoff/Fri, 26 Aug 2022 13:02:28 +0000https://tomhipwell.co/templates/kickoff/PreviewMarkdown Kick Off 1:1 This document is intended to be a first step together where we contract to understand what you need to get the most out of our 1:1 conversations. If you want to read more, Lara Hogan (who came up with the idea) has a good blog post on why Kick Off 1:1s are important. Contract The contract helps us to make sure we’re all on the same page.Brag Documenthttps://tomhipwell.co/templates/brag/Sat, 20 Aug 2022 13:13:38 +0000https://tomhipwell.co/templates/brag/PreviewMarkdown Brag Document For a bit of background, Julia Evans has a nice write up on brag documents here. The idea is that this is a quick space for you to jot down your achievements on a week to week basis. We can work through the list together to find themes, work out the big picture of what you’re working on and celebrate your accomplishments. When we come to do a performance review, we don’t need to rely on our probably fuzzy memories - we’ll have a written record of your achievements.Peakhttps://tomhipwell.co/decks/peak/Sat, 18 Jun 2022 10:55:10 +0000https://tomhipwell.co/decks/peak/Anders Ericsson’s masterpiece on how to acquire expertise through deliberate practice changed the way I approached learning - for example, by making my approach far more systematic and methodical. This deck summarises the core lessons of the book.How to take smart noteshttps://tomhipwell.co/reading/how_to_take_smart_notes/Thu, 26 May 2022 10:55:27 +0000https://tomhipwell.co/reading/how_to_take_smart_notes/Sönke Ahrens | 2017 | Amazon | Goodreads Why take notes? And why be structured and disciplined about how it is done? Academic success is not correlated to IQ north of 120, so what’s the secret sauce? There is no measurable correlation between a high IQ and academic success – at least not north of 120 This tallies with studies of Nobel prize winners, where the IQ range is given as >=120.