Skip to main content
Druce.ai
About Archive
  • Sep 15, 2023 datascience 

    Truth, Lies, and ChatGPT

    There are three kinds of lies: lies, damned lies, and statistics. - Mark Twain

    greatimposterposter.jpg

  • Sep 14, 2023 datascience 

    Bullshit

    That was just bullshit, Joel. - Miles, in Risky Business (1983)

    History is a set of lies agreed upon. - Napoleon Bonaparte

    Bullshit is the glue that binds us as a nation. - George Carlin

    /assets/2023/clinton-alien.png

  • Mar 29, 2023 datascience 

    ChatGPT, OpenAI, and the Generative AI Revolution

    I think it’s comparable in scale with the Industrial Revolution or electricity — or maybe the wheel. - Geoffrey Hinton

    Any sufficiently advanced technology is indistinguishable from magic. - Arthur C. Clarke

    GPT is a transformer so smart / That can write like a human or a bard / It can answer your queries / Or make stories so eerie / That you’ll wonder if it has a heart - GPT

  • Feb 12, 2023 datascience 

    NYC Subways and the Terrible, Horrible, No Good, Very Bad, Turnstile Data

    The future ain’t what it used to be. - Yogi Berra

    NYC Subway entries

  • Jan 22, 2023 datascience 

    Numbers With Wings: A Modern Data Stack-In-A-Box

    Not everything that counts can be counted, and not everything that can be counted counts. - Albert Einstein

    There are three kinds of people: those who can count, and those who can’t. - Source unknown

    NYC Subway entries

  • Dec 17, 2022 politics 

    Kant, Nietzsche, Elon Musk, SBF, wokeness, and the categorical imperative

    I beseech you, in the bowels of Christ, think it possible you may be mistaken. - Oliver Cromwell

  • Nov 26, 2022 datascience 

    Time Series Analysis In Theory
    • A regular time series is a function from integers to real numbers: \(y_t = f(t)\).
    • Many useful time series can be specified using linear difference equations like \(y_t = k_1y_{t-1} + k_2y_{t-2} + \dots + k_ny_{t-n}\)
    • This recurrence relation has a characteristic equation (and matrix representation), whose roots (or matrix eigenvalues) can be used to write closed-form solutions like \(y_t=ax^t\).
    • Any time series combining exponential growth/decay and sinusoidal components can be modeled by a linear difference equation or its matrix representation.
    Figure 1.
    Fig. 1. Possible regimes for a 2nd-order linear difference equation with complex eigenvalues

  • May 16, 2022 datascience 

    How I learned to stop worrying and love PCA: The optimal threshold for PCA dimensionality reduction

    PCA is an essential data science tool which uses the SVD to break down the linear relationships in data. The Gavish-Donoho optimal truncation threshold provides a simple formula to select a good threshold for dimensionality reduction.

    Figure 1.
    Fig. 1. A random 2D data set with singular vectors scaled by singular values

  • Apr 16, 2022 blockchain  tech 

    Crypto systems, iron laws, and levels of resilience

    Meditating on practical open distributed computing, how to build un-take-down-able apps like Web3 but without permissionless blockchains.

  • Mar 29, 2021 datascience 

    The AI Hierarchy of Needs

    The perpetual challenge is building upper tiers before lower tiers are 100%, and strengthening lower tiers without breaking upper tiers. /assets/2021/pyramid.png

  • Feb 19, 2021 investing 

    Optimal Safe Withdrawal for Retirement Using Certainty-Equivalent Spending, Revisited

    Revisiting Bengen’s “4% Rule” at various levels of risk aversion, and generalizing beyond a simple fixed-withdrawal, no-shortfall rule, to flexible rules at different levels of risk aversion.

  • Jan 14, 2021 politics 

    What I would have written if I were Jack Dorsey

    “Our decision to permanently suspend Donald Trump from the Twitter platform, may be a major inflection point in Twitter’s history. As CEO, I owe our users and employees a clear statement of why we took this action and how this decision evolved, i.e. not just some pablum about what a hard decision and potentially dangerous decision it was.”

  • Dec 6, 2020 datascience  markets  investing 

    Demystifying Portfolio Optimization with Python and CVXOPT

    Efficient frontier

    Do you want to do fast and easy portfolio optimization with Python? Then CVXOPT, and this post, are for you! Here’s a gentle intro to portfolio theory and some code to get you started.

  • Oct 12, 2020 datascience 

    Beyond Grid Search: Using Hyperopt, Optuna, and Ray Tune to hypercharge hyperparameter tuning for XGBoost and LightGBM

    RandomizedSearch HPO vs. Bayesian HPO

    Bayesian optimization of machine learning model hyperparameters works faster and better than grid search. Here’s how we can speed up hyperparameter tuning using 1) Bayesian optimization with Hyperopt and Optuna, running on… 2) the Ray distributed machine learning framework, with a unified API to many hyperparameter search algos and early stopping schedulers, and… 3) a distributed cluster of cloud instances for even faster tuning.

  • Aug 27, 2020 datascience 

    Deploy a Microservice to AWS Elastic Container Service: The Harder Way and the Easier Way

    A while back I made this Pizza service weekend project and I thought I could just press a button in AWS and deploy it in the cloud. It turned out to be… more complicated. With the latest version of Docker it’s getting easier. Here’s the harder (old) way and the easier (new) way. After some configuration, you can just say docker compose up and your container is deployed.

← Newer Posts Page 2 of 12 Older Posts →
2026 © Druce Vertes | About | Archive