Hướng dẫn nhanh

This guide covers every Firecrawl feature:
- Bắt đầu — Create a firecrawl account and set your api key
- How to Use Scrape — Pull llm ready data from single web pages
- How to Use Crawl — Crawl entire websites and follow every link
- Cách sử dụng chức năng tìm kiếm — Search the web for real time web data
- How to Use Map — Run url discovery across individual pages
- How to Use Extract — Extract structured data with natural language prompts
Thời gian cần thiết: 5 phút cho mỗi phim
Cũng trong hướng dẫn này: Mẹo chuyên nghiệp | Những lỗi thường gặp | Khắc phục sự cố | Chạy | Các lựa chọn thay thế
Tại sao nên tin tưởng hướng dẫn này?
I have used Firecrawl for over a year and tested every feature covered here.
This tutorial on how to use Firecrawl comes from real hands-on web scraping work.

Firecrawl is an ai powered web crawler that turns websites into clean, llm ready data.
But most users only scratch the surface of what this versatile tool can do.
This guide shows you how to use every major feature, step by step, with screenshots.
Firecrawl Tutorial
This complete Firecrawl tutorial walks you through every feature, from your first api key to advanced batch operations that make you a power user.

Bò trườn trên lửa
Turn any website into clean, structured data with a single api. Firecrawl handles javascript rendering, rate limits, and url discovery so you get llm ready data fast. Start on the free plan — no credit card needed.
Getting Started with Firecrawl
Trước khi sử dụng bất kỳ tính năng nào, hãy hoàn tất thiết lập một lần này.
Quá trình này mất khoảng 3 phút.
Hãy xem đoạn video tổng quan ngắn này trước nhé:
Bây giờ chúng ta hãy cùng xem xét từng bước một.
Bước 1: Tạo tài khoản của bạn
Go to the Firecrawl website at https://www.firecrawl.dev.
Click “Sign Up” to open a firecrawl account on the free plan.
Nhập email của bạn và tạo mật khẩu.
✓ Điểm kiểm tra: Kiểm tra của bạn hộp thư đến để nhận email xác nhận.
Step 2: Install and Set Your API Key
Run pip install firecrawl-py to add the api service to your project.
Copy your api key, then save it as an environment variable for bảo vệ.
Đây là giao diện của bảng điều khiển:

✓ Điểm kiểm tra: You should see the main dashboard with your usage and api key.
Step 3: Make Your First Call
Import the FirecrawlApp class and create a firecrawl instance with your key.
Every request hits the base endpoint at https://api.firecrawl.dev/v2.
✅ Hoàn thành: You’re ready to scrape, crawl, search, map, and extract data.
How to Use Firecrawl Scraper
Scrape lets you pull clean, llm ready data from a single page with one single api call.
Scrape mode targets individual pages and is ideal for extracting specific details fast.
Dưới đây là hướng dẫn sử dụng từng bước.
Bây giờ chúng ta hãy cùng phân tích từng bước.
Bước 1: Lấy mã API của bạn
Create a firecrawl account, then copy your api key from the dashboard.
Store it as an environment variable so your api key stays out of your raw code.
Step 2: Call the Scrape Endpoint
Pass a single url to the .scrape_url() method on your firecrawl instance.
You can request markdown format, raw html, or structured json as the output.
Đây là hình ảnh minh họa:

✓ Điểm kiểm tra: You should see clean markdown for the current page, ready for ai applications.
Step 3: Read the Clean Output
Firecrawl returns clean data with readable chữ, stripped of junk html tags.
✅ Kết quả: You turned one of the web pages into structured web data with a single api.
💡 Mẹo hay: Set the max_age parameter to cache results and skip re-scraping a page that has not changed.
How to Use Firecrawl Crawler
Crawl lets you crawl entire websites and follow links automatically without a sitemap.
Crawl mode lets you crawl websites across multiple pages and collect every reachable page.
Dưới đây là hướng dẫn sử dụng từng bước.
Bây giờ chúng ta hãy cùng phân tích từng bước.
Step 1: Point at a Root URL
Give the crawl endpoint a starting url such as https://example.com.
Firecrawl handles web crawling through links and follows pages on its own.
Step 2: Start the Crawl Job
Launch the crawl job and grab the returned job id to track progress.
Use the job id to poll status and pull crawled data as each page finishes.
Đây là hình ảnh minh họa:

✓ Điểm kiểm tra: You should see scraped pages from across the site building into web data.
Bước 3: Xuất kết quả
Collect every page as markdown or structured json for your data pipelines.
✅ Kết quả: You captured an entire site, even complex websites with dynamic content.
💡 Mẹo hay: Firecrawl uses batch processing and concurrent browsers, so large crawl operations stay fast.
How to Use Firecrawl Search Engine
Tìm kiếm lets you search the web and return clean results you can feed straight into ai agents.
Search lets you query the web and pull back real time web data as readable text.
Dưới đây là hướng dẫn sử dụng từng bước.
Bây giờ chúng ta hãy cùng phân tích từng bước.
Step 1: Send a Query
Call the search endpoint with a plain natural language query.
Firecrawl can search the web across news websites, job boards, and review sites.
Bước 2: Chọn định dạng đầu ra
Ask for markdown format or structured json for each result.
Every result arrives as clean data, not noisy raw html.
Đây là hình ảnh minh họa:

✓ Điểm kiểm tra: You should see ranked web pages returned as structured web data.
Step 3: Pipe Into Your App
Feed the structured data into trợ lý AI or ai workflows.
✅ Kết quả: You added live web content to your ai applications in a single api call.
💡 Mẹo hay: Pair search with extract to gather news articles for market research or competitive intelligence.
How to Use Firecrawl Advanced Map
Map lets you run url discovery and turn one domain into a full map of individual pages.
Map mode retrieves every url on a site quickly for fast url discovery.
Dưới đây là hướng dẫn sử dụng từng bước.
Bây giờ chúng ta hãy cùng phân tích từng bước.
Step 1: Submit the Domain
Pass a single root url to the map endpoint.
Firecrawl returns every link without needing a sitemap.
Step 2: Review the URL List
Scan the returned individual pages before you scrape them.
Save the list to a csv file to plan your data extraction.
Đây là hình ảnh minh họa:

✓ Điểm kiểm tra: You should see a full sitemap of web pages from one single url.
Step 3: Select What to Scrape
Pick the pages you want, then send them to scrape or crawl.
✅ Kết quả: You mapped the whole site, ready for targeted web scraping.
💡 Mẹo hay: Map first on dynamic websites and web apps to avoid wasting credits on pages you do not need.
How to Use Firecrawl Extractor
Chiết xuất lets you extract structured data from any page using natural language prompts.
Extract uses ai powered parsing to pull exactly the fields you describe.
Dưới đây là hướng dẫn sử dụng từng bước.
Bây giờ chúng ta hãy cùng phân tích từng bước.
Step 1: Define a Schema
Write a Pydantic model with: from pydantic import BaseModel.
Schema based extraction tells Firecrawl which fields to return as structured json.
Step 2: Describe the Data
Add natural language prompts for fields a schema cannot capture.
This is how you extract data from web pages without writing parsing rules.
Đây là hình ảnh minh họa:

✓ Điểm kiểm tra: You should see tidy structured data instead of messy html tags.
Step 3: Receive Structured JSON
Firecrawl returns clean structured json ready for training data.
✅ Kết quả: You replaced fragile scrapers with one extract call and less data cleaning.
💡 Mẹo hay: Use only_main_content, include_tags, and exclude_tags to keep the extracted web data focused.
Firecrawl Pro Tips and Shortcuts
After testing Firecrawl for over a year, here are my best tips for cleaner data extraction.
Phím tắt
| Hoạt động | Phím tắt |
|---|---|
| Run scrape in playground | Ctrl + Enter |
| Copy api key | Ctrl + C |
| Open docs | Ctrl + K |
| Switch output format | Tab |
Những tính năng ẩn mà hầu hết mọi người bỏ lỡ
- max_age caching: Reuse recent crawled data for faster, cheaper repeat scraping.
- Browser actions: Click, scroll, and type to reach content behind javascript rendering.
- Async batching: Process thousands of urls with batch processing without blocking your web apps.
Firecrawl Common Mistakes to Avoid
Mistake #1: Hardcoding your API key
❌ Sai: Pasting your api key directly into shared code or a public repo.
✅ Bên phải: Load the key from an environment variable so it stays private.
Mistake #2: Crawling before mapping
❌ Sai: Crawling entire websites blindly and burning credits on junk pages.
✅ Bên phải: Run url discovery with map first, then crawl only the pages you need.
Mistake #3: Parsing raw html yourself
❌ Sai: Writing brittle rules to clean raw html and strip html tags by hand.
✅ Bên phải: Use schema based extraction to get clean structured json directly.
Firecrawl Troubleshooting
Problem: 401 unauthorized errors
Gây ra: Your api key is missing or not loaded from the environment variable.
Sửa chữa: Re-export the key, then recreate your firecrawl instance and retry.
Problem: Timeout errors on complex websites
Gây ra: Heavy javascript rendering or dynamic content takes longer to load.
Sửa chữa: Add a wait action so the current page finishes loading before capture.
Problem: Hitting rate limits
Gây ra: Too many requests at once on the free plan or a lower tier.
Sửa chữa: Slow batch operations or upgrade for more concurrent browsers.
📌 Ghi chú: If none of these fix your issue, contact Firecrawl support.
Firecrawl là gì?
Bò trườn trên lửa is an ai powered web scraping tool that turns websites into clean, structured data.
Think of it as a web crawler that hands ai agents readable text instead of messy raw html.
Firecrawl was developed by Mendable.ai to reduce token waste for ai applications.
Hãy xem đoạn video tổng quan ngắn này:
Nó bao gồm các tính năng chính sau:
- Thu thập dữ liệu: Extract data from single web pages as markdown format or structured json.
- Bò trườn: Crawl entire websites and follow links without a sitemap.
- Tìm kiếm: Search the web and return clean data from news websites and review sites.
- Bản đồ: Fast url discovery that turns one domain into a full sitemap.
- Extract: Pull structured web data using natural language prompts and Pydantic schemas.
Firecrawl beats traditional web scraping by handling proxies, rate limits, and javascript rendering for you.
Unlike traditional tools that need Selenium, it serves data scientists and developers through one api service.
Để xem đánh giá đầy đủ, hãy xem bài viết của chúng tôi. Firecrawl review.

Bảng giá Firecrawl
Here’s what Firecrawl costs in 2026:
| Kế hoạch | Giá | Tốt nhất cho |
|---|---|---|
| Miễn phí | Miễn phí | Testing scrape and crawl on a few pages |
| Sở thích | 16 đô la/tháng | Solo developers and small data pipelines |
| Tiêu chuẩn | 83 đô la/tháng | Teams running regular web crawling jobs |
| Sự phát triển | 333 đô la/tháng | High-volume market research and monitoring |
Dùng thử miễn phí: Yes — the free plan lets you scrape a limited number of pages.
Đảm bảo hoàn tiền: No formal guarantee, but you can downgrade anytime.

💰 Giá trị tốt nhất: Standard — best balance of credits and concurrent browsers for most teams.
So sánh Firecrawl và các giải pháp thay thế
How does Firecrawl compare? Here’s the competitive landscape:
| Dụng cụ | Tốt nhất cho | Giá | Xếp hạng |
|---|---|---|---|
| Bò trườn trên lửa | AI-ready clean data | 16 đô la/tháng | ⭐ 3.5 |
| Ứng dụng | Prebuilt actors | 49 đô la/tháng | ⭐ 4.5 |
| Dữ liệu sáng | Proxy scale | $ Custom | ⭐ 4.6 |
| Crawl4AI | Mã nguồn mở | Miễn phí | ⭐ 4.4 |
| Scrapy | Python control | Miễn phí | ⭐ 4.5 |
| ScrapeGraphAI | LLM graph scraping | 20 đô la/tháng | ⭐ 4.3 |
Lựa chọn nhanh:
- Tốt nhất tổng thể: Firecrawl — cleanest llm ready data from a single api.
- Ngân sách tốt nhất: Crawl4AI — free and open source for hands-on data scientists.
- Phù hợp nhất cho người mới bắt đầu: Firecrawl — natural language prompts hide the hard parts.
- Best for proxy scale: Bright Data — huge proxy pool for complex websites.
🎯 Firecrawl Alternatives
Bạn đang tìm kiếm các lựa chọn thay thế cho Firecrawl? Dưới đây là những lựa chọn hàng đầu:
- 🚀 Apify: Marketplace of prebuilt actors for web scraping, good when you want ready-made scrapers over a single api.
- 🏢 Bright Data: Enterprise proxy network for huge crawl jobs and content monitoring at scale across dynamic websites.
- 💰 Crawl4AI: Free, open source crawler that outputs llm ready data, ideal for budget ai workflows and local runs.
- 🔧 Scrapy: Battle-tested Python framework giving developers full control over crawling, parsing, and data pipelines.
- 🧠 ScrapeGraphAI: Graph-based, ai powered extraction that maps page structure for schema based extraction of structured data.
Để xem danh sách đầy đủ, vui lòng xem trang của chúng tôi. Firecrawl alternatives hướng dẫn.
⚔️ So sánh Firecrawl
Dưới đây là cách Firecrawl so sánh với từng đối thủ cạnh tranh:
- So sánh Firecrawl và Apify: Firecrawl wins on clean, llm ready output; Apify wins on its library of prebuilt scrapers.
- So sánh Firecrawl và Bright Data: Bright Data wins on proxy scale; Firecrawl wins on simpler structured data extraction.
- Firecrawl đấu với Crawl4AI: Crawl4AI wins on price and self-hosting; Firecrawl wins on managed rate limits and reliability.
- Firecrawl đấu với Scrapy: Scrapy wins on low-level control; Firecrawl wins on speed and zero proxy setup.
- So sánh Firecrawl và ScrapeGraphAI: Both are ai powered; Firecrawl wins on crawl coverage, ScrapeGraphAI on graph logic.
Start Using Firecrawl Now
You learned how to use every major Firecrawl feature:
- ✅ Scrape
- ✅ Crawl
- ✅ Tìm kiếm
- ✅ Map
- ✅ Extract
Bước tiếp theo: Hãy chọn một tính năng và thử ngay bây giờ.
Most people start with Scrape.
Chỉ mất chưa đến 5 phút.
Câu hỏi thường gặp
Firecrawl được dùng để làm gì?
Firecrawl is an ai powered web scraping tool. It turns web pages into clean, llm ready data for ai agents, rag systems, market research, and price monitoring.
Bạn có thể sử dụng Firecrawl miễn phí không?
Yes. The free plan lets you scrape a limited number of pages with your api key, so you can test scrape, crawl, and extract before paying.
How do I install Firecrawl?
Run pip install firecrawl-py, then set your api key as an environment variable and create a firecrawl instance to start your first single api call.
How is Firecrawl different from traditional scraping?
Traditional web scraping needs Selenium and manual proxy setup. Firecrawl handles javascript rendering, rate limits, and clean data automatically through one api service.
Can Firecrawl extract structured data?
Yes. Use natural language prompts or a Pydantic schema for schema based extraction, and Firecrawl returns structured json instead of raw html.













