Skip to content

Instantly share code, notes, and snippets.

View aravindkarnam's full-sized avatar
🛠️
Building Crawl4AI

Aravind aravindkarnam

🛠️
Building Crawl4AI
View GitHub Profile
@aravindkarnam
aravindkarnam / FunctionBotRun.csx
Created July 13, 2018 21:17 — forked from ahelland/FunctionBotRun.csx
An Azure Function implementing the Microsoft Bot Framework to return random quotes
#r "Newtonsoft.Json"
#r "Microsoft.Rest.ClientRuntime.dll"
#r "Microsoft.Bot.Connector.Utilities.dll"
#r "Microsoft.Bot.Connector.dll"
using System.Net;
using Microsoft.Bot.Connector;
public static async Task<HttpResponseMessage> Run(HttpRequestMessage req, TraceWriter log)
{
log.Verbose($"C# HTTP trigger function processed a request. RequestUri={req.RequestUri}");
@aravindkarnam
aravindkarnam / main.py
Created July 27, 2018 22:37 — forked from khanhnamle1994/main.py
FCN - Full Code
#--------------------------
# USER-SPECIFIED DATA
#--------------------------
# Tune these parameters
num_classes = 2
image_shape = (160, 576)
EPOCHS = 40
BATCH_SIZE = 16
@aravindkarnam
aravindkarnam / .py
Created February 12, 2025 07:58
Scrape google playstore reviews with Crawl4AI
import asyncio
from crawl4ai.extraction_strategy import JsonCssExtractionStrategy
from crawl4ai import AsyncWebCrawler, CacheMode, CrawlerRunConfig, BrowserConfig
import json
import os
from playwright.async_api import Page, BrowserContext
async def before_retrieve_html(page: Page, context: BrowserContext, **kwargs):
# Called before final HTML retrieval.
print("[HOOK] before_retrieve_html - We can do final actions")
@aravindkarnam
aravindkarnam / .py
Last active March 7, 2025 10:05
Get crypto currency directory from coinbase with crawl4ai
import asyncio
import json
from crawl4ai import (
AsyncWebCrawler,
CacheMode,
CrawlerRunConfig,
LLMConfig,
JsonCssExtractionStrategy,
)
import time
@aravindkarnam
aravindkarnam / McD.py
Last active March 30, 2025 03:57
Extract Mcdonald's swiggy menu using Crawl4AI
import asyncio
import json
from pprint import pprint
from crawl4ai import (
AsyncWebCrawler,
CrawlerRunConfig,
BrowserConfig,
JsonCssExtractionStrategy,
LLMConfig,
)
@aravindkarnam
aravindkarnam / walmart.py
Last active March 26, 2025 07:21
Crawl walmart site with Crawl4AI
from crawl4ai import (
CrawlerRunConfig,
BrowserConfig,
AsyncWebCrawler,
JsonCssExtractionStrategy,
LLMConfig,
)
from crawl4ai.utils import preprocess_html_for_schema
import asyncio
@aravindkarnam
aravindkarnam / linkedin.py
Created March 26, 2025 07:20
Crawl linkedin pages with crawl4ai
from crawl4ai import (
CrawlerRunConfig,
BrowserConfig,
AsyncWebCrawler,
DefaultMarkdownGenerator,
BM25ContentFilter,
)
import asyncio
from pprint import pprint