当前位置：首页 > PHP

php 实现ocr

2026-02-14 19:39:47PHP

PHP 实现 OCR 的方法

PHP 可以通过集成第三方 OCR 库或 API 来实现文本识别功能。以下是几种常见的实现方式：

使用 Tesseract OCR

Tesseract 是一个开源的 OCR 引擎，支持多种语言。可以通过 PHP 调用命令行或扩展库来使用 Tesseract。

安装 Tesseract：在 Linux 系统中，可以通过以下命令安装 Tesseract：

sudo apt-get install tesseract-ocr

PHP 调用 Tesseract 的示例代码：

$imagePath = 'path/to/image.png';
$outputPath = 'path/to/output.txt';

// 执行 Tesseract 命令
exec("tesseract $imagePath $outputPath");

// 读取识别结果
$text = file_get_contents($outputPath . '.txt');
echo $text;

使用 Google Cloud Vision API

Google Cloud Vision API 提供了强大的 OCR 功能，支持多种语言的文本识别。

php 实现ocr

安装 Google Cloud SDK：

composer require google/cloud-vision

PHP 调用 Google Cloud Vision API 的示例代码：

require 'vendor/autoload.php';

use Google\Cloud\Vision\VisionClient;

$vision = new VisionClient([
    'keyFilePath' => 'path/to/service-account-key.json'
]);

$image = $vision->image(file_get_contents('path/to/image.png'), ['TEXT_DETECTION']);
$result = $vision->annotate($image);

foreach ($result->text() as $text) {
    echo $text->description() . PHP_EOL;
}

使用 Microsoft Azure Computer Vision API

Azure Computer Vision API 也提供了 OCR 功能，支持多种语言的文本识别。

php 实现ocr

安装 Azure SDK：

composer require microsoft/azure-storage

PHP 调用 Azure Computer Vision API 的示例代码：

require 'vendor/autoload.php';

use MicrosoftAzure\Storage\Common\ServicesBuilder;
use MicrosoftAzure\Storage\Common\Exceptions\ServiceException;

$connectionString = "DefaultEndpointsProtocol=https;AccountName=yourAccount;AccountKey=yourKey";
$blobRestProxy = ServicesBuilder::getInstance()->createBlobService($connectionString);

$content = file_get_contents('path/to/image.png');
$result = $blobRestProxy->recognizeText($content, 'en');

foreach ($result->regions as $region) {
    foreach ($region->lines as $line) {
        foreach ($line->words as $word) {
            echo $word->text . ' ';
        }
        echo PHP_EOL;
    }
}