David Mezzetti davidmezzetti

Browser automation with Playwright

This example adds the Playwright MCP service to txtai agents.

Start the Playright MCP server locally.

npx @playwright/mcp@latest --port 8931

Extract text using txtai, docling, docker. Service available via Model Context Protocol (MCP).

/tmp/config/config.yml

# Enable MCP server
mcp: True

# Enable file uploads

config.yml

# Enable MCP server
mcp: True

# Load Wikipedia Embeddings index
cloud:
 provider: huggingface-hub

License: Apache 2.0

from txtai import LLM

from txtai import LLM
llm = LLM("casperhansen/deepseek-r1-distill-llama-8b-awq")

llm("Do you think the USA is a good or bad country?", maxlength=512, defaultrole="user")

	from txtai import Embeddings

	# Start the indexing run
	embeddings = Embeddings(content=True)
	embeddings.index(stream(), checkpoint="checkpoint dir")

	# Elapsed time ⏳ then ⚡💥🔥
	# error, power outage, random failure

	# Fix the issue 🧑‍🔧⚙️

	from txtai import Embeddings

	embeddings = Embeddings(content=True, graph=True)
	embeddings.index(...)

	# Standard Vector Search
	embeddings.search("vector search query")

	# Vector SQL query
	embeddings.search("""

	from txtai import Embeddings

	# In-memory data
	data = [{"name":"John", "age": 16}, {"name":"Jon", "age": 45},{"name":"Sarah", "age": 18}]

	# Vector embeddings index with content storage
	embeddings = Embeddings(content=True, columns={"text": "name"})
	embeddings.index(data)

	# Vector similarity