Gboyega Dada gboyegadada

Quick access

Books I'm tracking

2025

September

Video models are zero-shot learners and reasoners

TL;DR

Veo 3 shows emergent zero-shot abilities across many visual tasks, indicating that video models are on a path to becoming vision foundation models—just like LLMs became foundation models for language.

Abstract

> The remarkable zero-shot capabilities of Large Language Models (LLMs) have propelled natural language processing from task-specific models to unified, generalist foundation models. This transformation emerged from simple primitives: large, generative models trained on web-scale data. Curiously, the same primitives apply to today's generative video models. Could video models be on a trajectory towards general-purpose vision understanding, much like LLMs developed general-purpose language understanding?

	#!/bin/bash
	#
	# This script configures WordPress file permissions based on recommendations
	# from http://codex.wordpress.org/Hardening_WordPress#File_permissions
	#
	# Author: Michael Conigliaro <mike [at] conigliaro [dot] org>
	#
	WP_OWNER=www-data # <-- wordpress owner
	WP_GROUP=www-data # <-- wordpress group
	WP_ROOT=$1 # <-- wordpress root directory

	/*
	********************************************************************************
	Golang - Asterisk and Ampersand Cheatsheet
	********************************************************************************

	Also available at: https://play.golang.org/p/lNpnS9j1ma

	Allowed:
	--------
	p := Person{"Steve", 28} stores the value

	import CoreGraphics
	import Accelerate
	import CoreImage
	import UIKit

	extension CGImage {

	public enum Error: Swift.Error {
	case imageResizingFailed
	case cgContextCreationFailed

	package main

	import (
	"crypto"
	"crypto/rand"
	"crypto/rsa"
	"crypto/sha256"
	"encoding/base64"
	"testing"
	)

	import SwiftUI
	import AVFoundation

	struct PlayerTimeView: View {
	let timeObserver: PlayerTimeObserver
	@State private var currentTime: TimeInterval = 0

	var body: some View {
	Text("\(Utility.formatSecondsToHMS(currentTime))")
	.onReceive(timeObserver.publisher) { time in

Gboyega Dada gboyegadada

Quick access

Books I'm tracking

2025

September

Video models are zero-shot learners and reasoners

TL;DR

Abstract