diff --git a/Document-Processing-toc.html b/Document-Processing-toc.html
index 9a1c4ae1a7..1a0d17ae33 100644
--- a/Document-Processing-toc.html
+++ b/Document-Processing-toc.html
@@ -172,119 +172,110 @@
Data Extraction
-
-
- OCR Processor
-
- NET
+ Conversions
diff --git a/Document-Processing/Data-Extraction/NET/Assemblies-required.md b/Document-Processing/Data-Extraction/NET/Assemblies-required.md
new file mode 100644
index 0000000000..388abaea9a
--- /dev/null
+++ b/Document-Processing/Data-Extraction/NET/Assemblies-required.md
@@ -0,0 +1,189 @@
+---
+title: Assemblies required for Data Extraction | Syncfusion
+description: This section details the Syncfusion assemblies required to configure and run Data Extraction seamlessly in .NET projects.
+platform: document-processing
+control: DataExtraction
+documentation: UG
+keywords: Assemblies
+---
+# Assemblies required for Data Extraction
+
+## Smart Data Extractor
+
+The following assemblies need to be referenced in your application to extract data from PDF documents or images, based on the platform.
+
+
+
+
+ Platform(s)
+ Assembly
+
+
+
+
+
+ {{'WPF'| markdownify }},
+ {{'Windows Forms'| markdownify }} and {{'ASP.NET MVC'| markdownify }}
+
+
+ Syncfusion.SmartDataExtractor.Base
+ Syncfusion.Compression.Base
+ Syncfusion.ImagePreProcessor.Base
+ Syncfusion.OCRProcessor.Base
+ Syncfusion.Pdf.Base
+ Syncfusion.PdfToImageConverter.Base
+ Syncfusion.SmartFormRecognizer.Base
+ Syncfusion.SmartTableExtractor.Base
+ Syncfusion.Markdown
+
+
+
+
+ {{'.NET Core'| markdownify }}
+ and {{'.NET Platforms'| markdownify }}
+
+
+ Syncfusion.SmartDataExtractor.Portable
+ Syncfusion.Compression.Portable
+ Syncfusion.ImagePreProcessor.Portable
+ Syncfusion.OCRProcessor.Portable
+ Syncfusion.Pdf.Imaging.Portable
+ Syncfusion.Pdf.Portable
+ Syncfusion.PdfToImageConverter.Portable
+ Syncfusion.SmartFormRecognizer.Portable
+ Syncfusion.SmartTableExtractor.Portable
+ Syncfusion.Markdown
+
+
+
+
+ {{'.NET Multi-platform App UI (.NET MAUI)'| markdownify }}
+
+
+ Syncfusion.SmartDataExtractor.NET
+ Syncfusion.Compression.NET
+ Syncfusion.ImagePreProcessor.NET
+ Syncfusion.OCRProcessor.NET
+ Syncfusion.Pdf.Imaging.NET
+ Syncfusion.Pdf.NET
+ Syncfusion.PdfToImageConverter.NET
+ Syncfusion.SmartFormRecognizer.NET
+ Syncfusion.SmartTableExtractor.NET
+ Syncfusion.Markdown
+
+
+
+
+
+## Smart Table Extractor
+
+The following assemblies need to be referenced in your application to extract table data from PDF documents or images, based on the platform.
+
+
+
+
+ Platform(s)
+ Assembly
+
+
+
+
+
+ {{'WPF'| markdownify }},
+ {{'Windows Forms'| markdownify }} and {{'ASP.NET MVC'|
+ markdownify }}
+
+
+ Syncfusion.SmartTableExtractor.Base
+ Syncfusion.Compression.Base
+ Syncfusion.ImagePreProcessor.Base
+ Syncfusion.OCRProcessor.Base
+ Syncfusion.Pdf.Base
+ Syncfusion.PdfToImageConverter.Base
+ Syncfusion.Markdown
+
+
+
+
+ {{'.NET Core'| markdownify }}
+ and {{'.NET Platforms'| markdownify }}
+
+
+ Syncfusion.SmartTableExtractor.Portable
+ Syncfusion.Compression.Portable
+ Syncfusion.ImagePreProcessor.Portable
+ Syncfusion.OCRProcessor.Portable
+ Syncfusion.Pdf.Imaging.Portable
+ Syncfusion.Pdf.Portable
+ Syncfusion.PdfToImageConverter.Portable
+ Syncfusion.Markdown
+
+
+
+
+ {{'.NET Multi-platform App UI (.NET MAUI)'| markdownify }}
+
+
+ Syncfusion.SmartTableExtractor.NET
+ Syncfusion.Compression.NET
+ Syncfusion.ImagePreProcessor.NET
+ Syncfusion.OCRProcessor.NET
+ Syncfusion.Pdf.Imaging.NET
+ Syncfusion.Pdf.NET
+ Syncfusion.PdfToImageConverter.NET
+ Syncfusion.Markdown
+
+
+
+
+
+
+## Smart Form Recognizer
+
+The following assemblies need to be referenced in your application to recognize and extract form fields from PDF documents or images, based on the platform.
+
+
+
+
+ Platform(s)
+ Assembly
+
+
+
+
+
+ {{'WPF'| markdownify }},
+ {{'Windows Forms'| markdownify }} and {{'ASP.NET MVC'| markdownify }}
+
+
+ Syncfusion.SmartFormRecognizer.Base
+ Syncfusion.Compression.Base
+ Syncfusion.Pdf.Base
+ Syncfusion.PdfToImageConverter.Base
+
+
+
+
+ {{'.NET Core'| markdownify }} and {{'.NET Platforms'| markdownify }}
+
+
+ Syncfusion.SmartFormRecognizer.Portable
+ Syncfusion.Compression.Portable
+ Syncfusion.Pdf.Portable
+ Syncfusion.PdfToImageConverter.Portable
+
+
+
+
+ {{'.NET Multi-platform App UI (.NET MAUI)'| markdownify }}
+
+
+ Syncfusion.SmartFormRecognizer.NET
+ Syncfusion.Compression.NET
+ Syncfusion.Pdf.NET
+ Syncfusion.PdfToImageConverter.NET
+
+
+
+
+
diff --git a/Document-Processing/Data-Extraction/NET/Extract-Data-in-ASP-NET-Core.md b/Document-Processing/Data-Extraction/NET/Extract-Data-in-ASP-NET-Core.md
new file mode 100644
index 0000000000..f73bc79aa8
--- /dev/null
+++ b/Document-Processing/Data-Extraction/NET/Extract-Data-in-ASP-NET-Core.md
@@ -0,0 +1,196 @@
+---
+title: Extract data in ASP.NET Core | Syncfusion
+description: Learn how to extract data from PDF in ASP.NET Core with step‑by‑step guidance using Syncfusion .NET Core Data extraction library.
+platform: document-processing
+control: SmartDataExtractor
+documentation: UG
+---
+
+# Extract Data in ASP.NET Core
+
+The Syncfusion® Smart Data Extractor is a .NET library used to extract structured data and document elements from PDF and image files in ASP.NET Core applications.
+
+To quickly get started with extracting structured data from PDF and image files in ASP.NET Core using the Smart Data Extractor library, refer to this video tutorial:
+{% youtube "https://www.youtube.com/watch?v=fxijc3BgDTY" %}
+
+To include the Smart Data Extractor library in your ASP.NET Core application, please refer to the [NuGet Packages Required](https://help.syncfusion.com/document-processing/data-extraction/smart-data-extractor/net/nuget-packages-required) or [Assemblies Required](https://help.syncfusion.com/document-processing/data-extraction/smart-data-extractor/net/assemblies-required) documentation.
+
+
+## Steps to Extract Data from PDF in ASP.NET Core application
+
+{% tabcontents %}
+{% tabcontent Visual Studio %}
+
+**Prerequisites**:
+
+* Install .NET SDK: Ensure that you have the .NET SDK installed on your system. You can download it from the [.NET Downloads page](https://dotnet.microsoft.com/en-us/download).
+* Install Visual Studio: Download and install Visual Studio from the [official website](https://visualstudio.microsoft.com/downloads/).
+
+Step 1: Create a new C# ASP.NET Core Web Application project.
+ 
+
+Step 2: In configuration window, name your project and click Next.
+ 
+ 
+
+
+Step 3: Install the [Syncfusion.SmartDataExtractor.Net.Core](https://www.nuget.org/packages/Syncfusion.SmartDataExtractor.Net.Core/) package as reference to your ASP.NET Core applications from [NuGet.org](https://www.nuget.org/).
+ 
+
+Step 4: A default controller named HomeController.cs is added on creation of ASP.NET Core project. Include the following namespaces in that HomeController.cs file.
+
+{% highlight c# tabtitle="C#" %}
+
+using System.Text;
+using System.Diagnostics;
+using Syncfusion.SmartDataExtractor;
+
+{% endhighlight %}
+
+Step 5: Add a new button in the Index.cshtml as shown below.
+
+{% highlight c# tabtitle="C#" %}
+
+@{
+ Html.BeginForm("ExtractData", "Home", FormMethod.Get);
+ {
+
+
+
+ }
+ Html.EndForm();
+}
+
+{% endhighlight %}
+
+Step 6: Add a new action method named `ExtractData` in HomeController.cs and include the following code example to extract data as JSON using the [DataExtractor](https://help.syncfusion.com/cr/document-processing/Syncfusion.SmartDataExtractor.DataExtractor.html) class. Then use the [ExtractDataAsJson](https://help.syncfusion.com/cr/document-processing/Syncfusion.SmartDataExtractor.DataExtractor.html#Syncfusion_SmartDataExtractor_DataExtractor_ExtractDataAsJson_System_IO_Stream_) method of the DataExtractor object to process the input and export the results in JSON format.
+
+{% highlight c# tabtitle="C#" %}
+
+// Open the input PDF file as a stream.
+using (FileStream stream = new FileStream(Path.GetFullPath("Input.pdf"), FileMode.Open, FileAccess.Read))
+{
+ // Initialize the Data Extractor.
+ DataExtractor extractor = new DataExtractor();
+ // Extract form data as JSON.
+ string data = extractor.ExtractDataAsJson(stream);
+ // Convert JSON string into a MemoryStream for download.
+ MemoryStream outputStream = new MemoryStream(Encoding.UTF8.GetBytes(data));
+ // Reset stream position.
+ outputStream.Position = 0;
+ // Return JSON file as download in browser.
+ FileStreamResult fileStreamResult = new FileStreamResult(outputStream, "application/json");
+ fileStreamResult.FileDownloadName = "Output.json";
+ return fileStreamResult;
+}
+
+{% endhighlight %}
+
+Step 7: Build the project.
+
+Click on **Build** → **Build Solution** or press Ctrl +Shift +B to build the project.
+
+Step 8: Run the project.
+
+Click the Start button (green arrow) or press F5 to run the application.
+
+{% endtabcontent %}
+
+{% tabcontent Visual Studio Code %}
+
+**Prerequisites**:
+
+* Install .NET SDK: Ensure that you have the .NET SDK installed on your system. You can download it from the [.NET Downloads page](https://dotnet.microsoft.com/en-us/download).
+* Install Visual Studio Code: Download and install Visual Studio Code from the [official website](https://code.visualstudio.com/download).
+* Install C# Extension for VS Code: Open Visual Studio Code, go to the Extensions view (Ctrl+Shift+X), and search for 'C#'. Install the official [C# extension provided by Microsoft](https://marketplace.visualstudio.com/items?itemName=ms-dotnettools.csharp).
+
+Step 1: Open the terminal (Ctrl+` ) and run the following command to create a C# ASP.NET Core Web Application project.
+
+```
+dotnet new mvc -n ExtractDataASPNETCoreAPP
+```
+Step 2: Replace **ExtractDataASPNETCoreAPP** with your desired project name.
+
+Step 3: Navigate to the project directory using the following command
+
+```
+cd ExtractDataASPNETCoreAPP
+```
+Step 4: Use the following command in the terminal to add the [Syncfusion.SmartDataExtractor.Net.Core ](https://www.nuget.org/packages/Syncfusion.SmartDataExtractor.Net.Core) package to your project.
+
+```
+dotnet add package Syncfusion.SmartDataExtractor.Net.Core
+```
+
+Step 5: A default controller named HomeController.cs gets added on creation of ASP.NET Core project. Include the following namespaces in that HomeController.cs file.
+
+{% highlight c# tabtitle="C#" %}
+
+using Syncfusion.SmartDataExtractor;
+using System.Diagnostics;
+using System.Text;
+
+{% endhighlight %}
+
+Step 6: A default action method named Index will be present in HomeController.cs. Right-click on Index method and select Go To View where you will be directed to its associated view page Index.cshtml. Add a new button in the Index.cshtml as shown below.
+
+{% highlight c# tabtitle="C#" %}
+
+@{
+ Html.BeginForm("ExtractData", "Home", FormMethod.Get);
+ {
+
+
+
+ }
+ Html.EndForm();
+}
+
+{% endhighlight %}
+
+Step 7: Add a new action method named `ExportToJson` in HomeController.cs and include the following code example to extract data as JSON using the DataExtractor (help.syncfusion.com in Bing) class. Then use the **ExtractDataAsJson** method of the DataExtractor object to process the input and export the results in JSON format.
+
+{% highlight c# tabtitle="C#" %}
+
+// Open the input PDF file as a stream.
+using (FileStream stream = new FileStream(Path.GetFullPath("Input.pdf"), FileMode.Open, FileAccess.Read))
+{
+ // Initialize the Data Extractor.
+ DataExtractor extractor = new DataExtractor();
+ // Extract form data as JSON.
+ string data = extractor.ExtractDataAsJson(stream);
+ // Convert JSON string into a MemoryStream for download.
+ MemoryStream outputStream = new MemoryStream(Encoding.UTF8.GetBytes(data));
+ // Reset stream position.
+ outputStream.Position = 0;
+ // Return JSON file as download in browser.
+ FileStreamResult fileStreamResult = new FileStreamResult(outputStream, "application/json");
+ fileStreamResult.FileDownloadName = "Output.json";
+ return fileStreamResult;
+}
+
+{% endhighlight %}
+
+Step 8: Build the project.
+
+Run the following command in terminal to build the project.
+
+```
+dotnet build
+```
+
+Step 9: Run the project.
+
+Run the following command in terminal to run the project.
+
+```
+dotnet run
+```
+{% endtabcontent %}
+
+{% endtabcontents %}
+
+You can download a complete working sample from [GitHub](https://github.com/SyncfusionExamples/PDF-Examples/tree/master/Data-Extraction/Getting-Started/ASP.NETCore/Extract_Data_as_JSON).
+
+By executing the program, you will get the JSON file as follows.
+
diff --git a/Document-Processing/Data-Extraction/NET/Extract-Data-in-ASP-NET-MVC.md b/Document-Processing/Data-Extraction/NET/Extract-Data-in-ASP-NET-MVC.md
new file mode 100644
index 0000000000..927479ab86
--- /dev/null
+++ b/Document-Processing/Data-Extraction/NET/Extract-Data-in-ASP-NET-MVC.md
@@ -0,0 +1,84 @@
+---
+title: Extract Data in ASP.NET MVC Application | Syncfusion
+description: Learn how to extract data in an ASP.NET MVC application with step‑by‑step guidance using the Syncfusion Data Extraction library.
+platform: document-processing
+control: SmartDataExtractor
+documentation: UG
+keywords: Assemblies
+
+---
+
+# Extract Data in ASP.NET MVC
+
+The Syncfusion® Smart Data Extractor is a .NET library used to extract structured data and document elements from PDFs and images in ASP.NET MVC applications.
+
+## Steps to Extract data from PDF document in ASP.NET MVC
+
+Step 1: Create a new C# ASP.NET Web Application (.NET Framework) project.
+
+
+Step 2: In the project configuration window, name your project and select Create.
+
+
+
+Step 3: Install [Syncfusion.SmartDataExtractor.AspNet.Mvc5](https://www.nuget.org/packages/Syncfusion.SmartDataExtractor.AspNet.Mvc5) NuGet package as reference to your .NET application from [NuGet.org](https://www.nuget.org/).
+
+
+Step 4: Include the following namespaces in the HomeController.cs file.
+
+{% highlight c# tabtitle="C#" %}
+
+using System.IO;
+using System.Text;
+using Syncfusion.SmartDataExtractor;
+
+{% endhighlight %}
+
+Step 5: Add a new button in the Index.cshtml as shown below.
+
+{% highlight c# tabtitle="C#" %}
+
+@{
+ ViewBag.Title = "Home Page";
+}
+
+
+ @using (Html.BeginForm("ExtractData", "Home", FormMethod.Get))
+ {
+
+ }
+
+
+{% endhighlight %}
+
+Step 6: Add a new action method named `ExtractData` in HomeController.cs and include the following code example to extract data from a PDF document using the [ExtractDataAsJson](https://help.syncfusion.com/cr/document-processing/Syncfusion.SmartDataExtractor.DataExtractor.html#Syncfusion_SmartDataExtractor_DataExtractor_ExtractDataAsJson_System_IO_Stream_) method in the [DataExtractor](https://help.syncfusion.com/cr/document-processing/Syncfusion.SmartDataExtractor.DataExtractor.html) class.
+
+{% highlight c# tabtitle="C#" %}
+
+// Resolve the path to the input PDF file inside the App_Data folder.
+string inputPath = Server.MapPath("~/App_Data/Input.pdf");
+
+// Open the input PDF file as a stream.
+using (FileStream stream = new FileStream(inputPath, FileMode.Open, FileAccess.ReadWrite))
+{
+ // Initialize the Data Extractor.
+ DataExtractor extractor = new DataExtractor();
+ // Extract form data as JSON.
+ string data = extractor.ExtractDataAsJson(stream);
+ // Convert JSON string into a MemoryStream for download.
+ MemoryStream outputStream = new MemoryStream(Encoding.UTF8.GetBytes(data));
+ // Reset stream position.
+ outputStream.Position = 0;
+ // Return JSON file as download in browser.
+ return File(outputStream, "application/json", "Output.json");
+}
+
+{% endhighlight %}
+
+A complete working sample can be downloaded from [GitHub](https://github.com/SyncfusionExamples/PDF-Examples/tree/master/Data-Extraction/Getting-Started/ASP.NETMVC/Extract_Data).
+
+By executing the program, you will get the JSON file as follows.
+
+
+Click [here](https://www.syncfusion.com/document-sdk/net-pdf-data-extraction) to explore the rich set of Syncfusion® Data Extraction library features.
+
\ No newline at end of file
diff --git a/Document-Processing/Data-Extraction/NET/Extract-Data-in-Blazor.md b/Document-Processing/Data-Extraction/NET/Extract-Data-in-Blazor.md
new file mode 100644
index 0000000000..902d76fad1
--- /dev/null
+++ b/Document-Processing/Data-Extraction/NET/Extract-Data-in-Blazor.md
@@ -0,0 +1,485 @@
+---
+title: Extract Data in Blazor Application | Syncfusion
+description: Learn to extract tables, forms, text, and images from PDF documents and scanned images in Blazor using the Syncfusion® Smart Data Extractor .NET library.
+platform: document-processing
+control: SmartDataExtractor
+documentation: UG
+keywords: Assemblies
+
+---
+
+# Extract Data from PDF in Blazor
+
+The Syncfusion® Smart Data Extractor is a .NET library used to extract structured data and document elements from PDFs and images in Blazor applications.
+
+## Steps to Extract Data from PDF in Blazor application
+
+{% tabcontents %}
+
+{% tabcontent Visual Studio %}
+
+**Prerequisites**:
+
+* Install .NET SDK: Ensure that you have the .NET SDK installed on your system. You can download it from the [.NET Downloads page](https://dotnet.microsoft.com/en-us/download).
+* Install Visual Studio: Download and install Visual Studio from the [official website](https://code.visualstudio.com/download).
+
+
+Step 1: Create a new C# Blazor Web App project.
+* Select "Blazor Web App" from the template and click **Next**.
+
+
+
+* Name the project and click **Next**.
+
+
+
+* Select the framework and click **Create** button.
+
+
+
+Step 2: Install the `Syncfusion.SmartDataExtractor.Net.Core` NuGet package.
+
+To **Extract Data from PDF in a Blazor Web App Server**, install [Syncfusion.SmartDataExtractor.Net.Core](https://www.nuget.org/packages/Syncfusion.SmartDataExtractor.Net.Core) into the Blazor project.
+
+
+
+Step 3: Create a Razor file named `Home.razor` in the `Pages` folder, which is located inside the `Components` folder.
+
+Add the following directives and service injections in the file
+
+{% tabs %}
+{% highlight c# tabtitle="C#" %}
+
+@page "/"
+@rendermode InteractiveServer
+@using Extract_Data_Blazor.Services
+@inject ExtractionService extractor
+@inject IJSRuntime JS
+
+{% endhighlight %}
+{% endtabs %}
+
+Step 4: Add a button to `Home.razor`.
+
+Include the following code snippet to add a button in your Blazor application that triggers the “Extract Data as JSON” conversion:
+
+{% tabs %}
+{% highlight CSHTML %}
+Run Extraction
+
+ Run Extractor
+
+@message
+
+{% endhighlight %}
+{% endtabs %}
+
+Step 5: Implement the method in `Home.razor`.
+
+Add the following code snippet to extract data from a PDF and download the file in your Blazor application.
+
+{% tabs %}
+{% highlight c# tabtitle="C#" %}
+@code {
+ string message = "Waiting...";
+ async Task RunExtraction()
+ {
+ message = "Processing...";
+ StateHasChanged(); // force UI update immediately
+ message = await extractor.RunExtraction();
+ }
+}
+{% endhighlight %}
+{% endtabs %}
+
+Step 6: Create a new cs file `ExtractionService.cs` in the `Services` folder.
+
+Include the following namespaces in the file:
+
+{% tabs %}
+
+{% highlight c# tabtitle="C#" %}
+using Syncfusion.Pdf.Parsing;
+using Syncfusion.SmartDataExtractor;
+{% endhighlight %}
+
+{% endtabs %}
+
+Step 7: Implement the method in `ExtractionService.cs`.
+
+Create a new method in the ExtractionService class, and add the following code snippet to extract data as JSON from a PDF in a Blazor Web App Server.
+
+{% tabs %}
+
+{% highlight c# tabtitle="C#" %}
+
+using (FileStream stream = new FileStream(@"wwwroot/Input.pdf", FileMode.Open, FileAccess.Read, FileShare.ReadWrite))
+{
+ // Initialize the Smart Data Extractor
+ DataExtractor extractor = new DataExtractor();
+ // Extract data as JSON string
+ string data = extractor.ExtractDataAsJson(stream);
+ // Return the JSON string
+ return data;
+}
+
+{% endhighlight %}
+
+{% endtabs %}
+
+Step 8: Add the service in Program.cs.
+
+Include the following namespaces in the Program.cs file:
+
+{% tabs %}
+{% highlight c# tabtitle="C#" %}
+
+using Extract_Data_Blazor.Components;
+using Extract_Data_Blazor.Services;
+
+{% endhighlight %}
+{% endtabs %}
+
+Add the following line to the `Program.cs` file to register `ExtractionService` as a scoped service in the Blazor application.
+
+{% tabs %}
+{% highlight c# tabtitle="C#" %}
+
+builder.Services.AddScoped();
+
+{% endhighlight %}
+{% endtabs %}
+
+Step 9: Create `FileUtils.cs` for JavaScript interoperability.
+
+Create a new class file named `FileUtils` in the project and add the following code to invoke the JavaScript action for file download in the browser.
+
+{% tabs %}
+
+{% highlight c# tabtitle="C#" %}
+
+public static class FileUtils
+{
+ public static ValueTask SaveAs(this IJSRuntime js, string filename, byte[] data)
+ => js.InvokeAsync(
+ "saveAsFile",
+ filename,
+ Convert.ToBase64String(data));
+}
+
+{% endhighlight %}
+
+{% endtabs %}
+
+Step 10: Add JavaScript function to `App.razor`.
+
+Add the following JavaScript function in the `App.razor` file located in the root of the project.
+
+{% tabs %}
+
+{% highlight HTML %}
+
+
+
+{% endhighlight %}
+
+{% endtabs %}
+
+Step 11: Add navigation link.
+
+Add the following code snippet to the Navigation menu's Razor file in the `Layout` folder.
+
+{% tabs %}
+
+{% highlight HTML %}
+
+
+
+ Data Extraction
+
+
+
+{% endhighlight %}
+
+{% endtabs %}
+
+Step 12: Build the project.
+
+Click on **Build** → **Build Solution** or press Ctrl +Shift +B to build the project.
+
+Step 13: Run the project.
+
+Click the Start button (green arrow) or press F5 to run the application.
+
+Upon executing the program, the JSON file will be generated as follows.
+
+
+
+{% endtabcontent %}
+
+{% tabcontent Visual Studio Code %}
+
+**Prerequisites:**
+
+* Visual Studio Code.
+* Install [.NET 8 SDK](https://dotnet.microsoft.com/en-us/download/dotnet/8.0) or later.
+* Open Visual Studio Code and install the [C# for Visual Studio Code extension](https://marketplace.visualstudio.com/items?itemName=ms-dotnettools.csharp) from the Extensions Marketplace.
+
+
+Step 1: Create a new C# Blazor Web App project.
+* Open the command palette by pressing Ctrl +Shift +P and type **.NET:New Project** and enter.
+* Choose the **Blazor Web App** template.
+
+
+
+* Select the project location, type the project name and press enter.
+* Then choose **Create project**.
+
+Step 2: To **Extract Data from PDF in Web app**, install [Syncfusion.SmartDataExtractor.Net.Core](https://www.nuget.org/packages/Syncfusion.SmartDataExtractor.Net.Core) to the Blazor project.
+* Press Ctrl + ` (backtick) to open the integrated terminal in Visual Studio Code.
+* Ensure you're in the project root directory where your .csproj file is located.
+* Run the command `dotnet add package Syncfusion.SmartDataExtractor.Net.Core` to install the NuGet package.
+
+```
+dotnet add package Syncfusion.SmartDataExtractor.NET.Core
+
+```
+
+Step 3: Create a Razor file named `Home.razor` in the `Pages` folder, which is located inside the `Components` folder.
+
+Add the following directives and service injections in the file
+
+{% tabs %}
+{% highlight c# tabtitle="C#" %}
+
+@page "/"
+@rendermode InteractiveServer
+@using Extract_Data_Blazor.Services
+@inject ExtractionService extractor
+@inject IJSRuntime JS
+
+{% endhighlight %}
+{% endtabs %}
+
+Step 4: Add a button to `Home.razor`.
+
+Include the following code snippet to add a button in your Blazor application that triggers the “Extract Data as JSON” conversion:
+
+{% tabs %}
+{% highlight CSHTML %}
+Run Extraction
+
+
+ Run Extractor
+
+
+@message
+
+
+{% endhighlight %}
+{% endtabs %}
+
+Step 5: Implement the method in `Home.razor`.
+
+Add the following code snippet to extract data from a PDF and download the file in your Blazor application.
+
+{% tabs %}
+{% highlight c# tabtitle="C#" %}
+@code {
+ string message = "Waiting...";
+ async Task RunExtraction()
+ {
+ message = "Processing... ";
+ StateHasChanged(); // force UI update immediately
+ // Run extractor to get JSON string
+ var json = await extractor.RunExtraction();
+ // Convert JSON to UTF8 bytes and trigger browser download via JS interop
+ var bytes = System.Text.Encoding.UTF8.GetBytes(json ?? string.Empty);
+ await JS.SaveAs("extracted.json", bytes);
+ message = "Download started";
+ }
+}
+{% endhighlight %}
+{% endtabs %}
+
+Step 6: Create a new cs file `ExtractionService.cs` in the `Services` folder.
+
+Include the following namespaces in the file:
+
+{% tabs %}
+
+{% highlight c# tabtitle="C#" %}
+
+using Syncfusion.Pdf.Parsing;
+using Syncfusion.SmartDataExtractor;
+
+{% endhighlight %}
+
+{% endtabs %}
+
+Step 7: Implement the method in `ExtractionService.cs`.
+
+Create a new method in the ExtractionService class, and add the following code snippet to extract data as JSON from a PDF in a Blazor Web App Server.
+
+{% tabs %}
+
+{% highlight c# tabtitle="C#" %}
+
+using (FileStream stream = new FileStream(@"wwwroot/Input.pdf", FileMode.Open, FileAccess.Read, FileShare.ReadWrite))
+{
+ // Initialize the Smart Data Extractor
+ DataExtractor extractor = new DataExtractor();
+
+ // Extract data as JSON string
+ string data = extractor.ExtractDataAsJson(stream);
+
+ // Return the JSON string
+ return data;
+}
+
+{% endhighlight %}
+
+{% endtabs %}
+
+Step 8: Add the service in Program.cs.
+
+Include the following namespaces in the Program.cs file:
+
+{% tabs %}
+{% highlight c# tabtitle="C#" %}
+
+using Extract_Data_Blazor.Components;
+using Extract_Data_Blazor.Services;
+
+{% endhighlight %}
+{% endtabs %}
+
+Add the following line to the `Program.cs` file to register `ExtractionService` as a scoped service in the Blazor application.
+
+{% tabs %}
+{% highlight c# tabtitle="C#" %}
+
+builder.Services.AddScoped();
+
+{% endhighlight %}
+{% endtabs %}
+
+Step 9: Create `FileUtils.cs` for JavaScript interoperability.
+
+Create a new class file named `FileUtils` in the project and add the following code to invoke the JavaScript action for file download in the browser.
+
+{% tabs %}
+
+{% highlight c# tabtitle="C#" %}
+
+public static class FileUtils
+{
+ public static ValueTask SaveAs(this IJSRuntime js, string filename, byte[] data)
+ => js.InvokeAsync(
+ "saveAsFile",
+ filename,
+ Convert.ToBase64String(data));
+}
+
+{% endhighlight %}
+
+{% endtabs %}
+
+Step 10: Add JavaScript function to `App.razor`.
+
+Add the following JavaScript function in the `App.razor` file located in the root of the project.
+
+{% tabs %}
+
+{% highlight HTML %}
+
+
+
+{% endhighlight %}
+
+{% endtabs %}
+
+Step 11: Add navigation link.
+
+Add the following code snippet to the Navigation menu's Razor file in the `Layout` folder.
+
+{% tabs %}
+
+{% highlight HTML %}
+
+
+
+ Data Extraction
+
+
+
+{% endhighlight %}
+
+{% endtabs %}
+
+Step 12: Build the project.
+
+Run the following command in terminal to build the project.
+
+```
+dotnet build
+```
+
+Step 13: Run the project.
+
+Run the following command in terminal to run the project.
+
+```
+dotnet run
+```
+
+Upon executing the program, the JSON file will be generated as follows.
+
+
+
+{% endtabcontent %}
+
+{% endtabcontents %}
+
diff --git a/Document-Processing/Data-Extraction/NET/Extract-Data-in-Console.md b/Document-Processing/Data-Extraction/NET/Extract-Data-in-Console.md
new file mode 100644
index 0000000000..9876cacf4c
--- /dev/null
+++ b/Document-Processing/Data-Extraction/NET/Extract-Data-in-Console.md
@@ -0,0 +1,212 @@
+---
+title: Extract Data in Console Application | Syncfusion
+description: Learn how to extract data in a Console Application by using the Syncfusion Smart Data Extractor efficiently.
+platform: document-processing
+control: SmartDataExtractor
+documentation: UG
+---
+
+# Extract Data from PDF in Console Application
+
+The Syncfusion® Smart Data Extractor is a .NET library used to extract structured data and document elements from PDFs and images in Console applications.
+
+## Steps to Extract Data from PDF in Console App
+
+{% tabcontents %}
+{% tabcontent Visual Studio %}
+
+**Prerequisites**:
+
+* Install .NET SDK: Ensure that you have the .NET SDK installed on your system. You can download it from the [.NET Downloads page](https://dotnet.microsoft.com/en-us/download).
+* Install Visual Studio: Download and install Visual Studio from the [official website](https://code.visualstudio.com/download).
+
+Step 1: Create a new C# Console Application project.
+
+
+Step 2: Name the project.
+
+
+Step 3: Install the [Syncfusion.SmartDataExtractor.Net.Core](https://www.nuget.org/packages/Syncfusion.SmartDataExtractor.Net.Core) NuGet package as reference to your .NET Standard applications from [NuGet.org](https://www.nuget.org).
+
+
+Step 4: Include the following namespaces in the *Program.cs* file.
+
+{% highlight c# tabtitle="C#" %}
+
+using System.IO;
+using Syncfusion.Pdf.Parsing;
+using Syncfusion.SmartDataExtractor;
+
+{% endhighlight %}
+
+Step 5: Include the following code snippet in *Program.cs* to Extract data from an PDF file.
+
+{% tabs %}
+
+{% highlight c# tabtitle="C# [Cross-platform]" %}
+
+//Open the input PDF file as a stream.
+using (FileStream stream = new FileStream("Input.pdf", FileMode.Open, FileAccess.Read))
+{
+ //Initialize the Data Extractor.
+ DataExtractor extractor = new DataExtractor();
+ //Extract data as JSON.
+ string data = extractor.ExtractDataAsJson(stream);
+ //Save the extracted JSON data into an output file.
+ File.WriteAllText("Output.json", data, Encoding.UTF8);
+}
+
+{% endhighlight %}
+
+{% endtabs %}
+
+Step 6: Build the project.
+
+Click on Build > Build Solution or press Ctrl + Shift + B to build the project.
+
+Step 7: Run the project.
+
+Click the Start button (green arrow) or press F5 to run the app.
+
+{% endtabcontent %}
+
+{% tabcontent Visual Studio Code %}
+**Prerequisites**:
+
+* Install .NET SDK: Ensure that you have the .NET SDK installed on your system. You can download it from the [.NET Downloads page](https://dotnet.microsoft.com/en-us/download).
+* Install Visual Studio Code: Download and install Visual Studio Code from the [official website](https://code.visualstudio.com/download).
+* Install C# Extension for VS Code: Open Visual Studio Code, go to the Extensions view (Ctrl+Shift+X), and search for 'C#'. Install the official [C# extension provided by Microsoft](https://marketplace.visualstudio.com/items?itemName=ms-dotnettools.csharp).
+
+
+
+Step 1: Open the terminal (Ctrl+` ) and run the following command to create a new .NET Core console application project.
+
+```
+dotnet new console -n ExtractDataConsoleApp
+```
+Step 2: Replace ****ExtractDataConsoleApp** with your desired project name.
+
+Step 3: Navigate to the project directory using the following command
+
+```
+cd ExtractDataConsoleApp
+```
+Step 4: Use the following command in the terminal to add the [Syncfusion.SmartDataExtractor.Net.Core](https://www.nuget.org/packages/Syncfusion.SmartDataExtractor.Net.Core) package to your project.
+
+```
+dotnet add package Syncfusion.SmartDataExtractor.Net.Core
+```
+
+Step 5: Include the following namespaces in the *Program.cs* file.
+
+{% highlight c# tabtitle="C#" %}
+
+using System.IO;
+using Syncfusion.Pdf.Parsing;
+using Syncfusion.SmartDataExtractor;
+
+{% endhighlight %}
+
+Step 6: Include the following code snippet in *Program.cs* to Extract data from an PDF file.
+
+{% highlight c# tabtitle="C# [Cross-platform]" %}
+
+//Open the input PDF file as a stream.
+using (FileStream stream = new FileStream("Input.pdf", FileMode.Open, FileAccess.Read))
+{
+ //Initialize the Data Extractor.
+ DataExtractor extractor = new DataExtractor();
+ //Extract data as JSON.
+ string data = extractor.ExtractDataAsJson(stream);
+ //Save the extracted JSON data into an output file.
+ File.WriteAllText("Output.json", data, Encoding.UTF8);
+}
+
+{% endhighlight %}
+
+Step 7: Build the project.
+
+Run the following command in terminal to build the project.
+
+```
+dotnet build
+```
+
+Step 8: Run the project.
+
+Run the following command in terminal to run the project.
+
+```
+dotnet run
+```
+
+{% endtabcontent %}
+
+{% endtabcontents %}
+
+You can download a complete working sample from [GitHub](https://github.com/SyncfusionExamples/PDF-Examples/tree/master/Data-Extraction/Getting-Started/Console/.NET/Extract_Data_as_JSON).
+
+By executing the program, you will get the JSON file as follows.
+
+
+## Extract Data from PDF using .NET Framework
+
+The following steps illustrates Extracting Data from PDF document in console application using .NET Framework.
+
+**Prerequisites**:
+
+* Install .NET SDK: Ensure that you have the .NET SDK installed on your system. You can download it from the [.NET Downloads page](https://dotnet.microsoft.com/en-us/download).
+* Install Visual Studio: Download and install Visual Studio from the [official website](https://code.visualstudio.com/download).
+
+**Steps to Extract Data from PDF using .NET Framework**
+
+Step 1: Create a new C# Console Application (.NET Framework) project.
+
+
+Step 2: Name the project.
+
+
+Step 3: Install the [Syncfusion.SmartDataExtractor.WinForms](https://www.nuget.org/packages/Syncfusion.SmartDataExtractor.WinForms/) NuGet package as reference to your .NET Standard applications from [NuGet.org](https://www.nuget.org).
+
+
+Step 4: Include the following namespaces in the *Program.cs*.
+
+{% highlight c# tabtitle="C#" %}
+
+using System.IO;
+using Syncfusion.Pdf.Parsing;
+using Syncfusion.SmartDataExtractor;
+
+{% endhighlight %}
+
+Step 5: Include the following code sample in *Program.cs* to Extract data from an PDF file.
+
+{% highlight c# tabtitle="C#" %}
+
+//Open the input PDF file as a stream.
+using (FileStream stream = new FileStream("Input.pdf", FileMode.Open, FileAccess.Read))
+{
+ //Initialize the Data Extractor.
+ DataExtractor extractor = new DataExtractor();
+ //Extract data as JSON.
+ string data = extractor.ExtractDataAsJson(stream);
+ //Save the extracted JSON data into an output file.
+ File.WriteAllText("Output.json", data, Encoding.UTF8);
+}
+
+{% endhighlight %}
+
+Step 6: Build the project.
+
+Click on Build > Build Solution or press Ctrl + Shift + B to build the project.
+
+Step 7: Run the project.
+
+Click the Start button (green arrow) or press F5 to run the app.
+
+You can download a complete working sample from [GitHub](https://github.com/SyncfusionExamples/PDF-Examples/tree/master/Data-Extraction/Getting-Started/Console/.NETFramework/Extract_Data).
+
+By executing the program, you will get the JSON file as follows.
+
+
+
diff --git a/Document-Processing/Data-Extraction/NET/Extract-Data-in-MAUI.md b/Document-Processing/Data-Extraction/NET/Extract-Data-in-MAUI.md
new file mode 100644
index 0000000000..902f8582d0
--- /dev/null
+++ b/Document-Processing/Data-Extraction/NET/Extract-Data-in-MAUI.md
@@ -0,0 +1,233 @@
+---
+title: Extract Data in .NET MAUI | Syncfusion
+description: Extract tables, forms, text, and images from PDF documents and scanned files in .NET MAUI using the Syncfusion® Smart Data Extractor.
+platform: document-processing
+control: SmartDataExtractor
+documentation: UG
+keywords: Assemblies
+
+---
+
+# Extract Data from PDF in .NET MAUI
+
+The Syncfusion® Smart Data Extractor is a .NET library used to extract structured data and document elements from PDFs and images in .NET MAUI applications.
+
+## Steps to Extract Data from PDF in .NET MAUI
+
+{% tabcontents %}
+
+{% tabcontent Visual Studio %}
+
+**Prerequisites:**
+
+* Visual Studio 2022.
+* Install [.NET 8 SDK](https://dotnet.microsoft.com/en-us/download/dotnet/8.0) or later.
+* For more details about installation, refer [here](https://learn.microsoft.com/en-us/dotnet/maui/get-started/installation?view=net-maui-7.0&tabs=vswin).
+
+Step 1: Create a new C# .NET MAUI app. Select **.NET MAUI App** from the template and click the **Next** button.
+
+
+
+Step 2: Enter the project name and click **Create**.
+
+
+
+Step 3: Install the [Syncfusion.SmartDataExtractor.NET](https://www.nuget.org/packages/Syncfusion.SmartDataExtractor.NET) NuGet package as a reference to your project from [NuGet.org](https://www.nuget.org/).
+
+
+
+N> If you reference Syncfusion® assemblies from trial setup or from the NuGet feed, you also have to add "Syncfusion.Licensing" assembly reference and include a license key in your projects. Please refer to this [link](https://help.syncfusion.com/common/essential-studio/licensing/overview) to know about registering a Syncfusion® license key in your application to use our components.
+
+Step 4: Add a new button to the **MainPage.xaml** as shown below.
+
+{% tabs %}
+
+{% highlight c# tabtitle="C#" %}
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+{% endhighlight %}
+
+{% endtabs %}
+
+Step 5: Include the following namespaces in the **MainPage.xaml.cs** file.
+
+{% tabs %}
+
+{% highlight c# tabtitle="C#" %}
+
+using System.Text;
+using Syncfusion.SmartDataExtractor;
+
+{% endhighlight %}
+
+{% endtabs %}
+
+Step 6: Add a new action method **OnExtractDataClicked** in MainPage.xaml.cs and include the following code snippet to **Extract Data from PDF**.
+
+{% tabs %}
+
+{% highlight c# tabtitle="C#" %}
+
+// Load the input PDF from the app package (include it in the project as a MauiAsset)
+using Stream inputStream = await FileSystem.OpenAppPackageFileAsync(Path.Combine("Data", "Input.pdf"));
+// Initialize Smart Data Extractor
+DataExtractor extractor = new DataExtractor();
+// Extract data as JSON string
+string data = extractor.ExtractDataAsJson(inputStream);
+// Save the extracted JSON data into an output file inside the application directory
+string outputPath = Path.Combine(Environment.CurrentDirectory, "Output", "Output.json");
+Directory.CreateDirectory(Path.GetDirectoryName(outputPath));
+File.WriteAllText(outputPath, data, Encoding.UTF8);
+// Show success message
+await DisplayAlert("Success", $"Extracted data saved to {outputPath}", "OK");
+
+{% endhighlight %}
+
+{% endtabs %}
+
+Step 7: Run the Application.
+
+1. Select the target framework, device or emulator.
+2. Press F5 to run the application.
+
+By executing the program, you will get the JSON file as follows.
+
+
+
+Click [here](https://www.syncfusion.com/document-sdk/net-pdf-data-extraction) to explore the rich set of Syncfusion® Data Extraction library features.
+
+
+{% endtabcontent %}
+
+{% tabcontent Visual Studio Code %}
+
+**Prerequisites:**
+
+* Install the latest .NET SDK and Visual Studio Code.
+* Open Visual Studio Code and install the [.NET MAUI for Visual Studio Code extension](https://marketplace.visualstudio.com/items?itemName=ms-dotnettools.dotnet-maui) from the Extensions Marketplace.
+* Follow the step-by-step setup guide:
+ - [Set up .NET MAUI with Visual Studio Code](https://learn.microsoft.com/en-us/dotnet/maui/get-started/installation?view=net-maui-9.0&tabs=visual-studio-code)
+ - [Steps for each platform](https://learn.microsoft.com/en-us/dotnet/maui/get-started/first-app?pivots=devices-windows&view=net-maui-9.0&tabs=visual-studio-code)
+
+Step 1: Create a new C# .NET MAUI app project.
+* Open the command palette by pressing Ctrl +Shift +P and type **.NET:New Project** and enter.
+* Choose the **.NET MAUI App** template.
+
+
+
+* Select the project location, type the project name and press enter.
+* Then choose **Create project**.
+
+Step 2: To **Extract Data from PDF Document in .NET MAUI app**, install [Syncfusion.SmartDataExtractor.NET](https://www.nuget.org/packages/Syncfusion.SmartDataExtractor.NET) to the MAUI project.
+* Press Ctrl + ` (backtick) to open the integrated terminal in Visual Studio Code.
+* Ensure you're in the project root directory where your .csproj file is located.
+* Run the command `dotnet add package Syncfusion.SmartDataExtractor.NET` to install the NuGet package.
+
+```
+dotnet add package Syncfusion.SmartDataExtractor.NET
+```
+
+Step 3: Add a new button to the **MainPage.xaml** as shown below.
+
+{% tabs %}
+
+{% highlight c# tabtitle="C#" %}
+
+
+
+
+
+
+
+
+
+
+
+
+{% endhighlight %}
+
+{% endtabs %}
+
+Step 4: Include the following namespaces in the **MainPage.xaml.cs** file.
+
+{% tabs %}
+
+{% highlight c# tabtitle="C#" %}
+
+using System.Text;
+using Syncfusion.SmartDataExtractor;
+
+{% endhighlight %}
+
+{% endtabs %}
+
+Step 5: Add a new action method **OnExtractDataClicked** in MainPage.xaml.cs and include the following code snippet to **Extract Data from PDF**.
+
+{% tabs %}
+
+{% highlight c# tabtitle="C#" %}
+
+// Load the input PDF from the app package (include it in the project as a MauiAsset)
+using Stream inputStream = await FileSystem.OpenAppPackageFileAsync(Path.Combine("Data", "Input.pdf"));
+// Initialize Smart Data Extractor
+DataExtractor extractor = new DataExtractor();
+// Extract data as JSON string
+string data = extractor.ExtractDataAsJson(inputStream);
+// Save the extracted JSON data into an output file inside the application directory
+string outputPath = Path.Combine(Environment.CurrentDirectory, "Output", "Output.json");
+Directory.CreateDirectory(Path.GetDirectoryName(outputPath));
+File.WriteAllText(outputPath, data, Encoding.UTF8);
+// Show success message
+await DisplayAlert("Success", $"Extracted data saved to {outputPath}", "OK");
+
+
+{% endhighlight %}
+
+{% endtabs %}
+
+Step 6: Run the Application.
+
+1. Select the target framework, device or emulator.
+2. Press F5 to run the application.
+
+By executing the program, you will get the JSON file as follows.
+
+
+
+{% endtabcontent %}
+
+{% endtabcontents %}
\ No newline at end of file
diff --git a/Document-Processing/Data-Extraction/NET/Extract-Data-in-WPF.md b/Document-Processing/Data-Extraction/NET/Extract-Data-in-WPF.md
new file mode 100644
index 0000000000..1cf5f85df7
--- /dev/null
+++ b/Document-Processing/Data-Extraction/NET/Extract-Data-in-WPF.md
@@ -0,0 +1,75 @@
+---
+title: Extract Data in WPF Application | Syncfusion
+description: Learn how to extract data in a WPF application with step‑by‑step guidance using the Syncfusion Smart Data Extractor library.
+platform: document-processing
+control: SmartDataExtractor
+documentation: UG
+keywords: Assemblies
+
+---
+
+# Extract Data from PDF in WPF
+
+The Syncfusion® Smart Data Extractor is a .NET library used to extract structured data and document elements from PDFs and images in WPF applications.
+
+## Steps to Extract Data from PDF document in WPF
+
+Step 1: Create a new WPF application project.
+
+
+In the project configuration window, name your project and select Create.
+
+
+Step 2: Install the [Syncfusion.SmartDataExtractor.WPF](https://www.nuget.org/packages/Syncfusion.SmartDataExtractor.WPF) NuGet package as a reference to your WPF application [NuGet.org](https://www.nuget.org/).
+
+
+
+Step 3: Include the following namespaces in the MainWindow.xaml.cs file.
+
+{% highlight c# tabtitle="C#" %}
+
+using System;
+using System.IO;
+using System.Text;
+using System.Windows;
+using Syncfusion.SmartDataExtractor;
+
+{% endhighlight %}
+
+Step 4: Add a new button in MainWindow.xaml to extract data from a PDF document as follows.
+
+{% highlight c# tabtitle="C#" %}
+
+
+
+
+
+{% endhighlight %}
+
+Step 5: Add the following code in `ExtractButton_Click` to extract data from a PDF document using the **ExtractDataAsJson** method in the **DataExtractor** class. The extracted content will be saved as a JSON file
+
+{% highlight c# tabtitle="C#" %}
+
+// Open the input PDF file as a stream.
+using (FileStream stream = new FileStream("Input.pdf", FileMode.Open, FileAccess.Read))
+{
+ // Initialize the Data Extractor.
+ DataExtractor extractor = new DataExtractor();
+ // Extract form data as JSON.
+ string data = extractor.ExtractDataAsJson(stream);
+ // Save the extracted JSON data into an output file (inline path).
+ File.WriteAllText("Output.json", data, Encoding.UTF8);
+}
+
+{% endhighlight %}
+
+A complete working sample can be downloaded from [GitHub](https://github.com/SyncfusionExamples/PDF-Examples/tree/master/Data-Extraction/Getting-Started/WPF/Extract_Data).
+
+By executing the program, you will get the JSON file as follows.
+
+
+ Click [here](https://www.syncfusion.com/document-sdk/net-pdf-data-extraction) to explore the rich set of Syncfusion® Data Extraction library features.
\ No newline at end of file
diff --git a/Document-Processing/Data-Extraction/NET/Extract-Data-in-Windows-Forms.md b/Document-Processing/Data-Extraction/NET/Extract-Data-in-Windows-Forms.md
new file mode 100644
index 0000000000..613b590592
--- /dev/null
+++ b/Document-Processing/Data-Extraction/NET/Extract-Data-in-Windows-Forms.md
@@ -0,0 +1,124 @@
+---
+title: Extract Data from PDF in Windows Forms | Syncfusion
+description: Extract tables, text, and form fields from PDF documents in Windows Forms using the Syncfusion Smart Data Extractor library.
+platform: document-processing
+control: SmartDataExtractor
+documentation: UG
+
+---
+
+# Extract Data in Windows Forms
+
+The Syncfusion® Smart Data Extractor is a .NET library used to extract structured data and document elements from PDFs and images in Windows Forms applications.
+
+## Steps to Extract Data in Windows Forms
+
+{% tabcontents %}
+
+{% tabcontent Visual Studio %}
+
+**Prerequisites:**
+
+* Visual Studio 2022.
+* Install **.NET desktop development** workload with necessary .NET Framework SDK.
+
+Step 1: Create a new Windows Forms application project.
+
+
+
+Step 2: Name the project.
+
+
+Step 3: Install [Syncfusion.SmartDataExtractor.WinForms](https://www.nuget.org/packages/Syncfusion.SmartDataExtractor.WinForms) NuGet package as a reference to your Windows Forms application from the [NuGet.org](https://www.nuget.org/).
+
+
+
+Step 4: Include the following namespaces in the **Form1.cs** file.
+
+{% tabs %}
+
+{% highlight c# tabtitle="C#" %}
+
+using System;
+using System.IO;
+using System.Text;
+using System.Windows.Forms;
+using Syncfusion.SmartDataExtractor;
+
+{% endhighlight %}
+
+{% endtabs %}
+
+Step 5: Add a new button in **Form1.Designer.cs** to extract data from PDF as follows.
+
+{% tabs %}
+
+{% highlight c# tabtitle="C#" %}
+
+private Button btnExtract;
+private Label label;
+
+private void InitializeComponent()
+{
+label = new Label();
+btnExtract = new Button();
+
+// Label
+label.Location = new System.Drawing.Point(0, 40);
+label.Size = new System.Drawing.Size(426, 35);
+label.Text = "Click the button to extract data from PDF using Smart Data Extractor.";
+label.TextAlign = System.Drawing.ContentAlignment.MiddleCenter;
+
+// Button
+btnExtract.Location = new System.Drawing.Point(160, 110);
+btnExtract.Size = new System.Drawing.Size(120, 36);
+btnExtract.Text = "Extract Data from PDF";
+btnExtract.Click += new EventHandler(btnExtract_Click);
+
+// Form
+ClientSize = new System.Drawing.Size(450, 150);
+Controls.Add(label);
+Controls.Add(btnExtract);
+Text = "Extract Data from PDF";
+}
+{% endhighlight %}
+
+{% endtabs %}
+
+Step 6: Add the following code in **btnExtract_Click** to extract data from PDF.
+
+{% tabs %}
+
+{% highlight c# tabtitle="C#" %}
+
+// Load the existing PDF document
+using (FileStream stream = new FileStream(Path.GetFullPath(@"../../Data/Input.pdf"), FileMode.Open, FileAccess.Read))
+{
+ // Initialize the Smart Data Extractor
+ DataExtractor extractor = new DataExtractor();
+ // Extract data as JSON string
+ string data = extractor.ExtractDataAsJson(stream);
+ // Save the extracted JSON data into an output file
+ File.WriteAllText(Path.GetFullPath(@"../../Output.json"), data, Encoding.UTF8);
+}
+
+{% endhighlight %}
+
+{% endtabs %}
+
+Step 7: Build the project.
+
+Click on Build → Build Solution or press Ctrl +Shift +B to build the project.
+
+Step 8: Run the project.
+
+Click the Start button (green arrow) or press F5 to run the app.
+
+By executing the program, you will get the JSON file as follows.
+
+
+
+{% endtabcontent %}
+
+{% endtabcontents %}
+
diff --git a/Document-Processing/Data-Extraction/NET/GettingStarted_images/ASPNETCoreAdditionalInfo.png b/Document-Processing/Data-Extraction/NET/GettingStarted_images/ASPNETCoreAdditionalInfo.png
new file mode 100644
index 0000000000..37df40189c
Binary files /dev/null and b/Document-Processing/Data-Extraction/NET/GettingStarted_images/ASPNETCoreAdditionalInfo.png differ
diff --git a/Document-Processing/Data-Extraction/NET/GettingStarted_images/ASPNETCoreConfiguration.png b/Document-Processing/Data-Extraction/NET/GettingStarted_images/ASPNETCoreConfiguration.png
new file mode 100644
index 0000000000..e085f0a6ef
Binary files /dev/null and b/Document-Processing/Data-Extraction/NET/GettingStarted_images/ASPNETCoreConfiguration.png differ
diff --git a/Document-Processing/Data-Extraction/NET/GettingStarted_images/ASPNETCoreProjectCreation.png b/Document-Processing/Data-Extraction/NET/GettingStarted_images/ASPNETCoreProjectCreation.png
new file mode 100644
index 0000000000..2b82f8b860
Binary files /dev/null and b/Document-Processing/Data-Extraction/NET/GettingStarted_images/ASPNETCoreProjectCreation.png differ
diff --git a/Document-Processing/Data-Extraction/NET/GettingStarted_images/ASPNETCore_Nuget.png b/Document-Processing/Data-Extraction/NET/GettingStarted_images/ASPNETCore_Nuget.png
new file mode 100644
index 0000000000..afed631297
Binary files /dev/null and b/Document-Processing/Data-Extraction/NET/GettingStarted_images/ASPNETCore_Nuget.png differ
diff --git a/Document-Processing/Data-Extraction/NET/GettingStarted_images/Blazor-Web-app-template.png b/Document-Processing/Data-Extraction/NET/GettingStarted_images/Blazor-Web-app-template.png
new file mode 100644
index 0000000000..23d65753b3
Binary files /dev/null and b/Document-Processing/Data-Extraction/NET/GettingStarted_images/Blazor-Web-app-template.png differ
diff --git a/Document-Processing/Data-Extraction/NET/GettingStarted_images/Blazor_Additional_Info.png b/Document-Processing/Data-Extraction/NET/GettingStarted_images/Blazor_Additional_Info.png
new file mode 100644
index 0000000000..7026b4209a
Binary files /dev/null and b/Document-Processing/Data-Extraction/NET/GettingStarted_images/Blazor_Additional_Info.png differ
diff --git a/Document-Processing/Data-Extraction/NET/GettingStarted_images/Blazor_Nuget.png b/Document-Processing/Data-Extraction/NET/GettingStarted_images/Blazor_Nuget.png
new file mode 100644
index 0000000000..417f400f3c
Binary files /dev/null and b/Document-Processing/Data-Extraction/NET/GettingStarted_images/Blazor_Nuget.png differ
diff --git a/Document-Processing/Data-Extraction/NET/GettingStarted_images/Blazor_ProjectCreation.png b/Document-Processing/Data-Extraction/NET/GettingStarted_images/Blazor_ProjectCreation.png
new file mode 100644
index 0000000000..1397a4a8a1
Binary files /dev/null and b/Document-Processing/Data-Extraction/NET/GettingStarted_images/Blazor_ProjectCreation.png differ
diff --git a/Document-Processing/Data-Extraction/NET/GettingStarted_images/Blazor_Project_Name.png b/Document-Processing/Data-Extraction/NET/GettingStarted_images/Blazor_Project_Name.png
new file mode 100644
index 0000000000..8e435bc3ce
Binary files /dev/null and b/Document-Processing/Data-Extraction/NET/GettingStarted_images/Blazor_Project_Name.png differ
diff --git a/Document-Processing/Data-Extraction/NET/GettingStarted_images/ConsoleCoreNuget.png b/Document-Processing/Data-Extraction/NET/GettingStarted_images/ConsoleCoreNuget.png
new file mode 100644
index 0000000000..afed631297
Binary files /dev/null and b/Document-Processing/Data-Extraction/NET/GettingStarted_images/ConsoleCoreNuget.png differ
diff --git a/Document-Processing/Data-Extraction/NET/GettingStarted_images/ConsoleCreation.png b/Document-Processing/Data-Extraction/NET/GettingStarted_images/ConsoleCreation.png
new file mode 100644
index 0000000000..3e3e836e3f
Binary files /dev/null and b/Document-Processing/Data-Extraction/NET/GettingStarted_images/ConsoleCreation.png differ
diff --git a/Document-Processing/Data-Extraction/NET/GettingStarted_images/ConsoleFramework_ProjectCreation.png b/Document-Processing/Data-Extraction/NET/GettingStarted_images/ConsoleFramework_ProjectCreation.png
new file mode 100644
index 0000000000..1f7fb124d1
Binary files /dev/null and b/Document-Processing/Data-Extraction/NET/GettingStarted_images/ConsoleFramework_ProjectCreation.png differ
diff --git a/Document-Processing/Data-Extraction/NET/GettingStarted_images/ConsoleFramework_ProjectName.png b/Document-Processing/Data-Extraction/NET/GettingStarted_images/ConsoleFramework_ProjectName.png
new file mode 100644
index 0000000000..b1c7c64ee7
Binary files /dev/null and b/Document-Processing/Data-Extraction/NET/GettingStarted_images/ConsoleFramework_ProjectName.png differ
diff --git a/Document-Processing/Data-Extraction/NET/GettingStarted_images/ConsoleName.png b/Document-Processing/Data-Extraction/NET/GettingStarted_images/ConsoleName.png
new file mode 100644
index 0000000000..0dde6e93fa
Binary files /dev/null and b/Document-Processing/Data-Extraction/NET/GettingStarted_images/ConsoleName.png differ
diff --git a/Document-Processing/Data-Extraction/NET/GettingStarted_images/ConsoleNuget.png b/Document-Processing/Data-Extraction/NET/GettingStarted_images/ConsoleNuget.png
new file mode 100644
index 0000000000..3348e61ca7
Binary files /dev/null and b/Document-Processing/Data-Extraction/NET/GettingStarted_images/ConsoleNuget.png differ
diff --git a/Document-Processing/Data-Extraction/NET/GettingStarted_images/ConsoleNuget_DataExtractor.png b/Document-Processing/Data-Extraction/NET/GettingStarted_images/ConsoleNuget_DataExtractor.png
new file mode 100644
index 0000000000..8fa0bc3a44
Binary files /dev/null and b/Document-Processing/Data-Extraction/NET/GettingStarted_images/ConsoleNuget_DataExtractor.png differ
diff --git a/Document-Processing/Data-Extraction/NET/GettingStarted_images/CreateASPNetMVCProject_DataExtractor.png b/Document-Processing/Data-Extraction/NET/GettingStarted_images/CreateASPNetMVCProject_DataExtractor.png
new file mode 100644
index 0000000000..885c477d62
Binary files /dev/null and b/Document-Processing/Data-Extraction/NET/GettingStarted_images/CreateASPNetMVCProject_DataExtractor.png differ
diff --git a/Document-Processing/Data-Extraction/NET/GettingStarted_images/Framework-name.png b/Document-Processing/Data-Extraction/NET/GettingStarted_images/Framework-name.png
new file mode 100644
index 0000000000..b1c7c64ee7
Binary files /dev/null and b/Document-Processing/Data-Extraction/NET/GettingStarted_images/Framework-name.png differ
diff --git a/Document-Processing/Data-Extraction/NET/GettingStarted_images/JSON_Output.png b/Document-Processing/Data-Extraction/NET/GettingStarted_images/JSON_Output.png
new file mode 100644
index 0000000000..6bf7a1f23a
Binary files /dev/null and b/Document-Processing/Data-Extraction/NET/GettingStarted_images/JSON_Output.png differ
diff --git a/Document-Processing/Data-Extraction/NET/GettingStarted_images/MAUI-app-template.png b/Document-Processing/Data-Extraction/NET/GettingStarted_images/MAUI-app-template.png
new file mode 100644
index 0000000000..fc4a9c6a8f
Binary files /dev/null and b/Document-Processing/Data-Extraction/NET/GettingStarted_images/MAUI-app-template.png differ
diff --git a/Document-Processing/Data-Extraction/NET/GettingStarted_images/MAUI_Additional_Info.png b/Document-Processing/Data-Extraction/NET/GettingStarted_images/MAUI_Additional_Info.png
new file mode 100644
index 0000000000..e7f0ba5d93
Binary files /dev/null and b/Document-Processing/Data-Extraction/NET/GettingStarted_images/MAUI_Additional_Info.png differ
diff --git a/Document-Processing/Data-Extraction/NET/GettingStarted_images/MAUI_Nuget.png b/Document-Processing/Data-Extraction/NET/GettingStarted_images/MAUI_Nuget.png
new file mode 100644
index 0000000000..b4e02f897f
Binary files /dev/null and b/Document-Processing/Data-Extraction/NET/GettingStarted_images/MAUI_Nuget.png differ
diff --git a/Document-Processing/Data-Extraction/NET/GettingStarted_images/MAUI_Project_Creation.png b/Document-Processing/Data-Extraction/NET/GettingStarted_images/MAUI_Project_Creation.png
new file mode 100644
index 0000000000..d0387cb2a0
Binary files /dev/null and b/Document-Processing/Data-Extraction/NET/GettingStarted_images/MAUI_Project_Creation.png differ
diff --git a/Document-Processing/Data-Extraction/NET/GettingStarted_images/MAUI_Project_Name.png b/Document-Processing/Data-Extraction/NET/GettingStarted_images/MAUI_Project_Name.png
new file mode 100644
index 0000000000..ced34803fc
Binary files /dev/null and b/Document-Processing/Data-Extraction/NET/GettingStarted_images/MAUI_Project_Name.png differ
diff --git a/Document-Processing/Data-Extraction/NET/GettingStarted_images/MVC_DataExtractorNuget.png b/Document-Processing/Data-Extraction/NET/GettingStarted_images/MVC_DataExtractorNuget.png
new file mode 100644
index 0000000000..b3fc595e7e
Binary files /dev/null and b/Document-Processing/Data-Extraction/NET/GettingStarted_images/MVC_DataExtractorNuget.png differ
diff --git a/Document-Processing/Data-Extraction/NET/GettingStarted_images/MVC_Data_ProjectCreation.png b/Document-Processing/Data-Extraction/NET/GettingStarted_images/MVC_Data_ProjectCreation.png
new file mode 100644
index 0000000000..c17d66a684
Binary files /dev/null and b/Document-Processing/Data-Extraction/NET/GettingStarted_images/MVC_Data_ProjectCreation.png differ
diff --git a/Document-Processing/Data-Extraction/NET/GettingStarted_images/MVC_ProjectCreation.png b/Document-Processing/Data-Extraction/NET/GettingStarted_images/MVC_ProjectCreation.png
new file mode 100644
index 0000000000..70ecdd5382
Binary files /dev/null and b/Document-Processing/Data-Extraction/NET/GettingStarted_images/MVC_ProjectCreation.png differ
diff --git a/Document-Processing/Data-Extraction/NET/GettingStarted_images/WPF_DataExtractor_Nuget.png b/Document-Processing/Data-Extraction/NET/GettingStarted_images/WPF_DataExtractor_Nuget.png
new file mode 100644
index 0000000000..9498903236
Binary files /dev/null and b/Document-Processing/Data-Extraction/NET/GettingStarted_images/WPF_DataExtractor_Nuget.png differ
diff --git a/Document-Processing/Data-Extraction/NET/GettingStarted_images/WPF_Data_ProjectName.png b/Document-Processing/Data-Extraction/NET/GettingStarted_images/WPF_Data_ProjectName.png
new file mode 100644
index 0000000000..f74a5f0b4e
Binary files /dev/null and b/Document-Processing/Data-Extraction/NET/GettingStarted_images/WPF_Data_ProjectName.png differ
diff --git a/Document-Processing/Data-Extraction/NET/GettingStarted_images/WPF_ProjectCreation.png b/Document-Processing/Data-Extraction/NET/GettingStarted_images/WPF_ProjectCreation.png
new file mode 100644
index 0000000000..f839cb86ae
Binary files /dev/null and b/Document-Processing/Data-Extraction/NET/GettingStarted_images/WPF_ProjectCreation.png differ
diff --git a/Document-Processing/Data-Extraction/NET/GettingStarted_images/Winforms_Nuget.png b/Document-Processing/Data-Extraction/NET/GettingStarted_images/Winforms_Nuget.png
new file mode 100644
index 0000000000..8fa0bc3a44
Binary files /dev/null and b/Document-Processing/Data-Extraction/NET/GettingStarted_images/Winforms_Nuget.png differ
diff --git a/Document-Processing/Data-Extraction/NET/GettingStarted_images/Winforms_ProjectCreation.png b/Document-Processing/Data-Extraction/NET/GettingStarted_images/Winforms_ProjectCreation.png
new file mode 100644
index 0000000000..4047fe2a95
Binary files /dev/null and b/Document-Processing/Data-Extraction/NET/GettingStarted_images/Winforms_ProjectCreation.png differ
diff --git a/Document-Processing/Data-Extraction/NET/GettingStarted_images/Winforms_ProjectName.png b/Document-Processing/Data-Extraction/NET/GettingStarted_images/Winforms_ProjectName.png
new file mode 100644
index 0000000000..5211132056
Binary files /dev/null and b/Document-Processing/Data-Extraction/NET/GettingStarted_images/Winforms_ProjectName.png differ
diff --git a/Document-Processing/Data-Extraction/NET/Nuget-packages-required.md b/Document-Processing/Data-Extraction/NET/Nuget-packages-required.md
new file mode 100644
index 0000000000..4f72da15b0
--- /dev/null
+++ b/Document-Processing/Data-Extraction/NET/Nuget-packages-required.md
@@ -0,0 +1,172 @@
+---
+title: NuGet Packages for Data Extraction | Syncfusion®
+description: Learn the NuGet packages required to use Syncfusion® Data Extraction in various platforms and frameworks.
+platform: document-processing
+control: DataExtraction
+documentation: UG
+keywords: Assemblies
+---
+
+# NuGet Packages Required for Data Extraction
+
+## Smart Data Extractor
+
+To work with Smart Data Extractor, the following NuGet packages need to be installed in your application from [nuget.org](https://www.nuget.org/).
+
+
+
+
+Platform(s)
+NuGet Package
+
+
+
+
+Windows Forms
+Console Application (Targeting .NET Framework)
+
+
+{{'[Syncfusion.SmartDataExtractor.WinForms.nupkg](https://www.nuget.org/packages/Syncfusion.SmartDataExtractor.WinForms/)'| markdownify }}
+
+
+
+
+WPF
+
+
+{{'[Syncfusion.SmartDataExtractor.Wpf.nupkg](https://www.nuget.org/packages/Syncfusion.SmartDataExtractor.Wpf)'| markdownify }}
+
+
+
+
+ASP.NET MVC5
+
+
+{{'[Syncfusion.SmartDataExtractor.AspNet.Mvc5.nupkg](https://www.nuget.org/packages/Syncfusion.SmartDataExtractor.AspNet.Mvc5)'| markdownify }}
+
+
+
+
+ASP.NET Core (Targeting NET Core)
+Console Application (Targeting .NET Core)
+
+
+{{'[Syncfusion.SmartDataExtractor.Net.Core.nupkg](https://www.nuget.org/packages/Syncfusion.SmartDataExtractor.Net.Core)'| markdownify }}
+
+
+
+
+Windows UI (WinUI)
+.NET Multi-platform App UI (.NET MAUI)
+
+
+{{'[Syncfusion.SmartDataExtractor.NET.nupkg](https://www.nuget.org/packages/Syncfusion.SmartDataExtractor.NET)'| markdownify }}
+
+
+
+
+## Smart Table Extractor
+
+To work with Smart Table Extractor, the following NuGet packages need to be installed in your application from [nuget.org](https://www.nuget.org/).
+
+
+
+
+Platform(s)
+NuGet Package
+
+
+
+
+Windows Forms
+Console Application (Targeting .NET Framework)
+
+
+{{'[Syncfusion.SmartTableExtractor.WinForms.nupkg](https://www.nuget.org/packages/Syncfusion.SmartTableExtractor.WinForms/)'| markdownify }}
+
+
+
+
+WPF
+
+
+{{'[Syncfusion.SmartTableExtractor.Wpf.nupkg](https://www.nuget.org/packages/Syncfusion.SmartTableExtractor.Wpf)'| markdownify }}
+
+
+
+
+ASP.NET MVC5
+
+
+{{'[Syncfusion.SmartTableExtractor.AspNet.Mvc5.nupkg](https://www.nuget.org/packages/Syncfusion.SmartTableExtractor.AspNet.Mvc5)'| markdownify }}
+
+
+
+
+ASP.NET Core (Targeting NET Core)
+Console Application (Targeting .NET Core)
+
+
+{{'[Syncfusion.SmartTableExtractor.Net.Core.nupkg](https://www.nuget.org/packages/Syncfusion.SmartTableExtractor.Net.Core)'| markdownify }}
+
+
+
+
+Windows UI (WinUI)
+.NET Multi-platform App UI (.NET MAUI)
+
+
+{{'[Syncfusion.SmartTableExtractor.NET.nupkg](https://www.nuget.org/packages/Syncfusion.SmartTableExtractor.NET)'| markdownify }}
+
+
+
+
+
+## Smart Form Recognizer
+
+To work with Smart Form Recognizer, the following NuGet packages need to be installed in your application from [nuget.org](https://www.nuget.org/).
+
+
+
+
+Platform(s)
+NuGet Package
+
+
+
+
+Windows Forms
+
+
+{{'[Syncfusion.SmartFormRecognizer.WinForms.nupkg](https://www.nuget.org/packages/Syncfusion.SmartFormRecognizer.WinForms)'| markdownify }}
+
+
+
+
+WPF
+
+
+{{'[Syncfusion.SmartFormRecognizer.Wpf.nupkg](https://www.nuget.org/packages/Syncfusion.SmartFormRecognizer.Wpf)'| markdownify }}
+
+
+
+Blazor, .NET Core and .NET Platforms
+
+{{'[Syncfusion.SmartFormRecognizer.NET.nupkg](https://www.nuget.org/packages/Syncfusion.SmartFormRecognizer.NET)'| markdownify }}
+
+
+
+
+{{'[Syncfusion.SmartFormRecognizer.Net.Core.nupkg](https://www.nuget.org/packages/Syncfusion.SmartFormRecognizer.Net.Core)'| markdownify }}
+
+
+
+
+ASP.NET MVC5
+
+
+{{'[Syncfusion.SmartFormRecognizer.AspNet.MVC5.nupkg](https://www.nuget.org/packages/Syncfusion.SmartFormRecognizer.AspNet.MVC5)'| markdownify }}
+
+
+
+
diff --git a/Document-Processing/Data-Extraction/NET/conversions/overview.md b/Document-Processing/Data-Extraction/NET/conversions/overview.md
new file mode 100644
index 0000000000..02cbf0e46d
--- /dev/null
+++ b/Document-Processing/Data-Extraction/NET/conversions/overview.md
@@ -0,0 +1,13 @@
+---
+title: Smart Data Extractor Library | Syncfusion
+description: Smart Data Extractor converts PDF documents and images to structured formats like JSON, Markdown (MD), and PDF output.
+platform: document-processing
+control: SmartDataExtractor
+documentation: UG
+keywords: Assemblies
+---
+
+# Welcome to Syncfusion® Smart Data Extractor Library
+
+Syncfusion® Smart Data Extractor Library extracts structured information from PDF documents and scanned images. It supports conversions such as **PDF to JSON**, **PDF to Markdown (MD)**, and generating **PDF output** by analyzing visual layout patterns like text blocks, tables, headers, and form fields. This helps developers easily integrate the extractor to achieve required data conversions while focusing on the core logic of their applications.
+
diff --git a/Document-Processing/Data-Extraction/NET/conversions/pdf-to-json.md b/Document-Processing/Data-Extraction/NET/conversions/pdf-to-json.md
new file mode 100644
index 0000000000..6129b51642
--- /dev/null
+++ b/Document-Processing/Data-Extraction/NET/conversions/pdf-to-json.md
@@ -0,0 +1,127 @@
+---
+title: Extract PDF to JSON in C# | Smart Data Extractor | Syncfusion
+description: Learn how to extract structured data from PDF documents as JSON in C# using the Syncfusion® Smart Data Extractor library for .NET applications.
+platform: document-processing
+control: SmartDataExtractor
+documentation: UG
+keywords: Assemblies
+---
+
+# PDF to JSON Extraction
+
+JavaScript Object Notation (JSON) is a lightweight data‑interchange format that is easy for humans to read and write, and simple for machines to parse and generate. The Syncfusion® Smart Data Extractor library extracts structured information from PDF documents and scanned images, and outputs the content as JSON. It analyzes text blocks, tables, headers, and form fields to preserve structure, enabling developers to integrate PDF to JSON extraction into their applications.
+
+## Assemblies and NuGet packages required
+
+Refer to the following links for the assemblies and NuGet packages required on different platforms to extract data as a JSON file using the Smart Data Extractor library.
+
+* [Assemblies required for PDF to JSON Extraction](/document-processing/data-extraction/net/Assemblies-required)
+* [NuGet packages required for PDF to JSON Extraction](/document-processing/data-extraction/net/Nuget-packages-required)
+
+## Extract Data as JSON from PDF or Image
+
+To extract form fields across a PDF document using the [ExtractDataAsJson](https://help.syncfusion.com/cr/document-processing/Syncfusion.SmartDataExtractor.DataExtractor.html#Syncfusion_SmartDataExtractor_DataExtractor_ExtractDataAsJson_System_IO_Stream_) method of the [DataExtractor](https://help.syncfusion.com/cr/document-processing/Syncfusion.SmartDataExtractor.DataExtractor.html) class, refer to the following code example:
+
+{% tabs %}
+
+{% highlight c# tabtitle="C# [Cross-platform]" %}
+
+using System.IO;
+using Syncfusion.SmartDataExtractor;
+using Syncfusion.SmartFormRecognizer;
+using System.Text;
+
+//Open the input PDF file as a stream.
+using (FileStream stream = new FileStream("Input.pdf", FileMode.Open, FileAccess.Read))
+{
+ //Initialize the Data Extractor.
+ DataExtractor extractor = new DataExtractor();
+ //Extract data as JSON.
+ string data = extractor.ExtractDataAsJson(stream);
+ //Save the extracted JSON data into an output file.
+ File.WriteAllText("Output.json", data, Encoding.UTF8);
+}
+
+{% endhighlight %}
+
+{% highlight c# tabtitle="C# [Windows-specific]" %}
+
+using System.IO;
+using Syncfusion.SmartDataExtractor;
+using Syncfusion.SmartFormRecognizer;
+using System.Text;
+
+//Open the input PDF file as a stream.
+using (FileStream stream = new FileStream("Input.pdf", FileMode.Open, FileAccess.Read))
+{
+ //Initialize the Data Extractor.
+ DataExtractor extractor = new DataExtractor();
+ //Extract data as JSON.
+ string data = extractor.ExtractDataAsJson(stream);
+ //Save the extracted JSON data into an output file.
+ File.WriteAllText("Output.json", data, Encoding.UTF8);
+}
+
+{% endhighlight %}
+
+{% endtabs %}
+
+N> If you want to extract data from an image instead of a PDF, replace the input stream with the image file (for example, Input.jpg or Input.png). The rest of the code remains unchanged.
+
+You can download a complete working sample from [GitHub](https://github.com/SyncfusionExamples/PDF-Examples/tree/master/Data-Extraction/Smart-Data-Extractor/Extract-data-as-JSON-from-PDF/.NET).
+
+
+## Extract Data from a Customized Page Range
+
+To extract data from a specific range of pages in a PDF document using the [ExtractDataAsJson](https://help.syncfusion.com/cr/document-processing/Syncfusion.SmartDataExtractor.DataExtractor.html#Syncfusion_SmartDataExtractor_DataExtractor_ExtractDataAsJson_System_IO_Stream_) method of the [DataExtractor](https://help.syncfusion.com/cr/document-processing/Syncfusion.SmartDataExtractor.DataExtractor.html) class, refer to the following code example:
+
+{% tabs %}
+
+{% highlight c# tabtitle="C# [Cross-platform]" %}
+
+using System.IO;
+using Syncfusion.SmartDataExtractor;
+using System.Text;
+
+//Open the input PDF file as a stream.
+using (FileStream stream = new FileStream("Input.pdf", FileMode.Open, FileAccess.Read))
+{
+ //Initialize the Data Extractor.
+ DataExtractor extractor = new DataExtractor();
+ //Set the page range for extraction (pages 1 to 3).
+ extractor.PageRange = new int[,] { { 1, 3 } };
+ //Extract data as JSON string.
+ string data = extractor.ExtractDataAsJson(stream);
+ //Save the extracted JSON data into an output file.
+ File.WriteAllText("Output.json", data, Encoding.UTF8);
+}
+
+
+{% endhighlight %}
+
+{% highlight c# tabtitle="C# [Windows-specific]" %}
+
+using System.IO;
+using Syncfusion.SmartDataExtractor;
+using System.Text;
+
+//Open the input PDF file as a stream.
+using (FileStream stream = new FileStream("Input.pdf", FileMode.Open, FileAccess.Read))
+{
+ //Initialize the Data Extractor.
+ DataExtractor extractor = new DataExtractor();
+ //Set the page range for extraction (pages 1 to 3).
+ extractor.PageRange = new int[,] { { 1, 3 } };
+ //Extract data as JSON string.
+ string data = extractor.ExtractDataAsJson(stream);
+ //Save the extracted JSON data into an output file.
+ File.WriteAllText("Output.json", data, Encoding.UTF8);
+}
+
+{% endhighlight %}
+
+{% endtabs %}
+
+## JSON Output Structure and Attributes
+
+The JSON output from the extraction contains structured attributes. For more details on the extracted JSON structure and attributes, refer to the [JSON Attributes](https://help.syncfusion.com/document-processing/data-extraction/smart-data-extractor/net/overview#json-output-structure-and-attributes) documentation.
\ No newline at end of file
diff --git a/Document-Processing/Data-Extraction/NET/conversions/pdf-to-markdown.md b/Document-Processing/Data-Extraction/NET/conversions/pdf-to-markdown.md
new file mode 100644
index 0000000000..2156093d70
--- /dev/null
+++ b/Document-Processing/Data-Extraction/NET/conversions/pdf-to-markdown.md
@@ -0,0 +1,213 @@
+---
+title: Extract PDF to Markdown in C# | Smart Data Extractor | Syncfusion
+description: Extract PDF documents as Markdown (MD) in C# using Syncfusion® Smart Data Extractor library without Microsoft Office or Adobe dependencies
+platform: document-processing
+control: SmartDataExtractor
+documentation: UG
+keywords: Assemblies
+---
+
+# PDF to Markdown Extraction
+
+Markdown is a lightweight markup language that adds formatting elements to plain text documents. The Syncfusion® Smart Data Extractor library extracts structured information from PDF documents and scanned images, and outputs the content as Markdown (MD). It analyzes text blocks, tables, headers, and form fields to preserve layout and formatting.
+
+## Assemblies and NuGet packages required
+
+Refer to the following links for assemblies and NuGet packages required based on platforms to Extract data as Markdown file using the .NET Word Library (DocIO).
+
+* [PDF to Markdown Extraction assemblies](/document-processing/data-extraction/net/Assemblies-required)
+* [PDF to Markdown Extraction NuGet packages](/document-processing/data-extraction/net/Nuget-packages-required)
+
+## Extract Data as Markdown from PDF or Image
+
+To extract form fields across a PDF document using the [ExtractDataAsMarkdown](https://help.syncfusion.com/cr/document-processing/Syncfusion.SmartDataExtractor.DataExtractor.html#Syncfusion_SmartDataExtractor_DataExtractor_ExtractDataAsMarkdown_System_IO_Stream_) method of the [DataExtractor](https://help.syncfusion.com/cr/document-processing/Syncfusion.SmartDataExtractor.DataExtractor.html) class, refer to the following code example:
+
+{% tabs %}
+
+{% highlight c# tabtitle="C# [Cross-platform]" playgroundButtonLink="https://raw.githubusercontent.com/SyncfusionExamples/PDF-Examples/refs/heads/master/Data-Extraction/Smart-Data-Extractor/Extract-data-as-MD-from-PDF/.NET/Extract-data-as-MD-from-PDF/Program.cs" %}
+
+using System.IO;
+using Syncfusion.SmartDataExtractor;
+using System.Text;
+
+//Open the input PDF file as a stream.
+using (FileStream stream = new FileStream("Input.pdf", FileMode.Open, FileAccess.Read))
+{
+ //Initialize the Data Extractor.
+ DataExtractor extractor = new DataExtractor();
+ //Extract data as Markdown.
+ string data = extractor.ExtractDataAsMarkdown(stream);
+ //Save the extracted Markdown data into an output file.
+ File.WriteAllText("Output.md", data, Encoding.UTF8);
+}
+
+{% endhighlight %}
+
+{% highlight c# tabtitle="C# [Windows-specific]" %}
+
+using System.IO;
+using Syncfusion.SmartDataExtractor;
+using System.Text;
+
+//Open the input PDF file as a stream.
+using (FileStream stream = new FileStream("Input.pdf", FileMode.Open, FileAccess.Read))
+{
+ //Initialize the Data Extractor.
+ DataExtractor extractor = new DataExtractor();
+ //Extract data as Markdown.
+ string data = extractor.ExtractDataAsMarkdown(stream);
+ //Save the extracted Markdown data into an output file.
+ File.WriteAllText("Output.md", data, Encoding.UTF8);
+}
+
+{% endhighlight %}
+
+{% endtabs %}
+
+N> If you want to extract data from an image instead of a PDF, replace the input stream with the image file (for example, Input.jpg or Input.png). The rest of the code remains unchanged.
+
+You can download a complete working sample from [GitHub](https://github.com/SyncfusionExamples/PDF-Examples/tree/master/Data-Extraction/Smart-Data-Extractor/Extract-data-as-MD-from-PDF/.NET).
+
+## Extract a specific page to Markdown
+
+The following code demonstrates how to use the [ExtractDataAsMarkdown](https://help.syncfusion.com/cr/document-processing/Syncfusion.SmartDataExtractor.DataExtractor.html#Syncfusion_SmartDataExtractor_DataExtractor_ExtractDataAsMarkdown_System_IO_Stream_) method of the [DataExtractor](https://help.syncfusion.com/cr/document-processing/Syncfusion.SmartDataExtractor.DataExtractor.html) class to extract content from a selected page in a PDF and save it as a Markdown file by specifying its page index.
+
+{% tabs %}
+
+{% highlight c# tabtitle="C# [Cross-platform]" %}
+
+using System.IO;
+using Syncfusion.SmartDataExtractor;
+using System.Text;
+
+//Open the input PDF file as a stream.
+using (FileStream stream = new FileStream("Input.pdf", FileMode.Open, FileAccess.Read))
+{
+ //Initialize the Data Extractor.
+ DataExtractor extractor = new DataExtractor();
+ //Set the page index for extraction (example: page 2).
+ extractor.PageRange = new int[,] { { 2, 2 } };
+ //Extract data as Markdown using the API.
+ string data = extractor.ExtractDataAsMarkdown(stream);
+ //Save the extracted Markdown data into an output file.
+ File.WriteAllText("Output.md", data, Encoding.UTF8);
+}
+
+{% endhighlight %}
+
+{% highlight c# tabtitle="C# [Windows-specific]" %}
+
+using System.IO;
+using Syncfusion.SmartDataExtractor;
+using System.Text;
+
+//Open the input PDF file as a stream.
+using (FileStream stream = new FileStream("Input.pdf", FileMode.Open, FileAccess.Read))
+{
+ //Initialize the Data Extractor.
+ DataExtractor extractor = new DataExtractor();
+ //Set the page index for extraction (example: page 2).
+ extractor.PageRange = new int[,] { { 2, 2 } };
+ //Extract data as Markdown using the API.
+ string data = extractor.ExtractDataAsMarkdown(stream);
+ //Save the extracted Markdown data into an output file.
+ File.WriteAllText("Output.md", data, Encoding.UTF8);
+}
+
+
+{% endhighlight %}
+
+{% endtabs %}
+
+## Extract a range of pages to Markdown
+
+The following code demonstrates how to use the [ExtractDataAsMarkdown](https://help.syncfusion.com/cr/document-processing/Syncfusion.SmartDataExtractor.DataExtractor.html#Syncfusion_SmartDataExtractor_DataExtractor_ExtractDataAsMarkdown_System_IO_Stream_) method of the [DataExtractor](https://help.syncfusion.com/cr/document-processing/Syncfusion.SmartDataExtractor.DataExtractor.html) class to extract content from a range of pages in a PDF and save it as a Markdown file by specifying the page range.
+
+{% tabs %}
+
+{% highlight c# tabtitle="C# [Cross-platform]" %}
+
+using System.IO;
+using Syncfusion.SmartDataExtractor;
+using System.Text;
+
+//Open the input PDF file as a stream.
+using (FileStream stream = new FileStream("Input.pdf", FileMode.Open, FileAccess.Read))
+{
+ //Initialize the Data Extractor.
+ DataExtractor extractor = new DataExtractor();
+ //Set the page range for extraction (pages 1 to 3).
+ extractor.PageRange = new int[,] { { 1, 3 } };
+ //Extract data as Markdown using the API.
+ string data = extractor.ExtractDataAsMarkdown(stream);
+ //Save the extracted Markdown data into an output file.
+ File.WriteAllText("Output.md", data, Encoding.UTF8);
+}
+
+{% endhighlight %}
+
+{% highlight c# tabtitle="C# [Windows-specific]" %}
+
+using System.IO;
+using Syncfusion.SmartDataExtractor;
+using System.Text;
+
+//Open the input PDF file as a stream.
+using (FileStream stream = new FileStream("Input.pdf", FileMode.Open, FileAccess.Read))
+{
+ //Initialize the Data Extractor.
+ DataExtractor extractor = new DataExtractor();
+ //Set the page range for extraction (pages 1 to 3).
+ extractor.PageRange = new int[,] { { 1, 3 } };
+ //Extract data as Markdown using the API.
+ string data = extractor.ExtractDataAsMarkdown(stream);
+ //Save the extracted Markdown data into an output file.
+ File.WriteAllText("Output.md", data, Encoding.UTF8);
+}
+
+{% endhighlight %}
+
+{% endtabs %}
+
+## PDF to Markdown Preservation Mapping
+
+This section explains how common PDF elements are converted and preserved in Markdown format, ensuring that document structure and formatting remain consistent during the PDF to Markdown conversion process.
+
+
+
+
+ PDF Elements
+ Preservation in Markdown
+
+
+
+
+ Header, Paragraph Title, Document Title
+ Headings (H2)
+
+
+ Paragraph
+ Paragraph
+
+
+ Image
+ Image (base64 string)
+
+
+ Table
+ Table
+
+
+ Text Inline Styles
+ Bold and Italic
+
+
+ Link text without title text
+ Links
+
+
+ Code blocks, Footer, Page Number, List, Block quotes, Subscript, Superscript
+ Text
+
+
+
\ No newline at end of file
diff --git a/Document-Processing/Data-Extraction/Smart-Data-Extractor/NET/data-extraction-images/onnx-table.png b/Document-Processing/Data-Extraction/NET/data-extraction-images/onnx-table.png
similarity index 100%
rename from Document-Processing/Data-Extraction/Smart-Data-Extractor/NET/data-extraction-images/onnx-table.png
rename to Document-Processing/Data-Extraction/NET/data-extraction-images/onnx-table.png
diff --git a/Document-Processing/Data-Extraction/Smart-Data-Extractor/NET/data-extraction-images/onnx.png b/Document-Processing/Data-Extraction/NET/data-extraction-images/onnx.png
similarity index 100%
rename from Document-Processing/Data-Extraction/Smart-Data-Extractor/NET/data-extraction-images/onnx.png
rename to Document-Processing/Data-Extraction/NET/data-extraction-images/onnx.png
diff --git a/Document-Processing/Data-Extraction/NET/data-extractor-overview.md b/Document-Processing/Data-Extraction/NET/data-extractor-overview.md
new file mode 100644
index 0000000000..482a056e2c
--- /dev/null
+++ b/Document-Processing/Data-Extraction/NET/data-extractor-overview.md
@@ -0,0 +1,28 @@
+---
+title: Data Extraction and Conversion in .NET | Syncfusion
+description: Syncfusion Data Extraction is a .NET library that extracts tables, forms, text, and images from PDF or image files, and outputs JSON or Markdown.
+platform: document-processing
+control: DataExtraction
+documentation: UG
+keywords: Assemblies
+---
+
+# Welcome to Syncfusion Smart Data Extractor Library
+
+{% doccards %}
+
+{% doccard text="ASP.NET Core" link="/document-processing/data-extraction/net/Extract-Data-in-ASP-NET-Core" icon="/document-processing/data-extraction/net/images/ASP.NET-Core.png" %}
+
+{% doccard text="ASP.NET MVC" link="/document-processing/data-extraction/net/Extract-Data-in-ASP-NET-MVC" icon="/document-processing/data-extraction/net/images/ASP.NET-Mvc.svg" %}
+
+{% doccard text="Blazor" link="/document-processing/data-extraction/net/Extract-Data-in-Blazor" icon="/document-processing/data-extraction/net/images/Blazor.svg" %}
+
+{% doccard text="Windows Forms" link="/document-processing/data-extraction/net/Extract-Data-in-Windows-Forms" icon="/document-processing/data-extraction/net/images/Windows-Forms.svg" %}
+
+{% doccard text="WPF" link="/document-processing/data-extraction/net/Extract-Data-in-WPF" icon="/document-processing/data-extraction/net/images/WPF.svg" %}
+
+{% doccard text=".NET MAUI" link="/document-processing/data-extraction/net/Extract-Data-in-MAUI" icon="/document-processing/data-extraction/net/images/MAUI.svg" %}
+
+{% enddoccards %}
+
+Syncfusion® **Smart Data Extractor** is a robust, deterministic C# library built to capture complete document structures from PDFs and images. Optimized for .NET workflows, it interprets visual layout patterns—lines, boxes, labels, and alignment—to accurately identify and extract *tables, text blocks, images, headers, footers, and form fields*. Each extracted element includes per‑field confidence scores, enabling precise validation, seamless export, and smooth integration into applications.
\ No newline at end of file
diff --git a/Document-Processing/Data-Extraction/NET/how-to-download-and-install.md b/Document-Processing/Data-Extraction/NET/how-to-download-and-install.md
new file mode 100644
index 0000000000..15fb57d637
--- /dev/null
+++ b/Document-Processing/Data-Extraction/NET/how-to-download-and-install.md
@@ -0,0 +1,27 @@
+---
+layout: post
+title: Installing Syncfusion Data Extraction - Syncfusion
+description: Learn how to install the Syncfusion Smart Data Extractor library for extracting structured data from PDFs and images in .NET applications.
+platform: document-processing
+control: Installation and Deployment
+documentation: ug
+
+---
+
+# Download Syncfusion® Data Extraction Add-On
+
+The Syncfusion® Data Extraction Add-On can be downloaded from the [Syncfusion download page](https://www.syncfusion.com/downloads).
+
+
+### Download Data Extraction Add-On Setup
+
+1. You can evaluate our Data Extraction Add-On by visiting the [Download](https://www.syncfusion.com/downloads) page and selecting the product.
+2. An unlock key is not required to download the Syncfusion® Data Extraction Add-On.
+3. You can download the Data Extraction Add-On at any time from your registered account’s [Trials & Downloads](https://www.syncfusion.com/account/manage-trials/downloads) page by clicking the More Download Options (as shown in the screenshot below).
+
+
+
+4. The Syncfusion Data Extraction Add-On is provided in ZIP format. After downloading, extract the file to access assemblies and demos for PDF and image data extraction.
+ 
+
+N> The Syncfusion Data Extraction Add‑On is available in ZIP format for Windows, Linux, and Mac. Extract the file to access the assemblies and demos for PDF or image data extraction.
\ No newline at end of file
diff --git a/Document-Processing/Data-Extraction/NET/images/ASP.NET-Core.png b/Document-Processing/Data-Extraction/NET/images/ASP.NET-Core.png
new file mode 100644
index 0000000000..5e1e5142b4
Binary files /dev/null and b/Document-Processing/Data-Extraction/NET/images/ASP.NET-Core.png differ
diff --git a/Document-Processing/Data-Extraction/NET/images/ASP.NET-Mvc.svg b/Document-Processing/Data-Extraction/NET/images/ASP.NET-Mvc.svg
new file mode 100644
index 0000000000..b78c550b18
--- /dev/null
+++ b/Document-Processing/Data-Extraction/NET/images/ASP.NET-Mvc.svg
@@ -0,0 +1,10 @@
+
+
+
+
+
+
+
+
+
+
diff --git a/Document-Processing/Data-Extraction/NET/images/AWS.svg b/Document-Processing/Data-Extraction/NET/images/AWS.svg
new file mode 100644
index 0000000000..b49d99d702
--- /dev/null
+++ b/Document-Processing/Data-Extraction/NET/images/AWS.svg
@@ -0,0 +1,6 @@
+
+
+
+
+
+
diff --git a/Document-Processing/Data-Extraction/NET/images/Azure.svg b/Document-Processing/Data-Extraction/NET/images/Azure.svg
new file mode 100644
index 0000000000..beed989fc2
--- /dev/null
+++ b/Document-Processing/Data-Extraction/NET/images/Azure.svg
@@ -0,0 +1,4 @@
+
+
+
+
diff --git a/Document-Processing/Data-Extraction/NET/images/Blazor.svg b/Document-Processing/Data-Extraction/NET/images/Blazor.svg
new file mode 100644
index 0000000000..88a978c9b8
--- /dev/null
+++ b/Document-Processing/Data-Extraction/NET/images/Blazor.svg
@@ -0,0 +1,4 @@
+
+
+
+
diff --git a/Document-Processing/Data-Extraction/NET/images/Docker.svg b/Document-Processing/Data-Extraction/NET/images/Docker.svg
new file mode 100644
index 0000000000..8665c73d83
--- /dev/null
+++ b/Document-Processing/Data-Extraction/NET/images/Docker.svg
@@ -0,0 +1,11 @@
+
+
+
+
+
+
+
+
+
+
+
diff --git a/Document-Processing/Data-Extraction/NET/images/Flutter.svg b/Document-Processing/Data-Extraction/NET/images/Flutter.svg
new file mode 100644
index 0000000000..be7ab0f6d1
--- /dev/null
+++ b/Document-Processing/Data-Extraction/NET/images/Flutter.svg
@@ -0,0 +1,24 @@
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
diff --git a/Document-Processing/Data-Extraction/NET/images/Google-Cloud.svg b/Document-Processing/Data-Extraction/NET/images/Google-Cloud.svg
new file mode 100644
index 0000000000..0c41b6c56d
--- /dev/null
+++ b/Document-Processing/Data-Extraction/NET/images/Google-Cloud.svg
@@ -0,0 +1,14 @@
+
+
+
+
+
+
+
+
+
+
+
+
+
+
diff --git a/Document-Processing/Data-Extraction/NET/images/Installer1.png b/Document-Processing/Data-Extraction/NET/images/Installer1.png
new file mode 100644
index 0000000000..2ca55e97f4
Binary files /dev/null and b/Document-Processing/Data-Extraction/NET/images/Installer1.png differ
diff --git a/Document-Processing/Data-Extraction/NET/images/Installer2.png b/Document-Processing/Data-Extraction/NET/images/Installer2.png
new file mode 100644
index 0000000000..5c6b8bcb5e
Binary files /dev/null and b/Document-Processing/Data-Extraction/NET/images/Installer2.png differ
diff --git a/Document-Processing/Data-Extraction/NET/images/Javascript.svg b/Document-Processing/Data-Extraction/NET/images/Javascript.svg
new file mode 100644
index 0000000000..fa8d58507a
--- /dev/null
+++ b/Document-Processing/Data-Extraction/NET/images/Javascript.svg
@@ -0,0 +1,18 @@
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
diff --git a/Document-Processing/Data-Extraction/NET/images/Linux.png b/Document-Processing/Data-Extraction/NET/images/Linux.png
new file mode 100644
index 0000000000..30269d44c4
Binary files /dev/null and b/Document-Processing/Data-Extraction/NET/images/Linux.png differ
diff --git a/Document-Processing/Data-Extraction/NET/images/MAUI.svg b/Document-Processing/Data-Extraction/NET/images/MAUI.svg
new file mode 100644
index 0000000000..bf68001e5f
--- /dev/null
+++ b/Document-Processing/Data-Extraction/NET/images/MAUI.svg
@@ -0,0 +1,15 @@
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
diff --git a/Document-Processing/Data-Extraction/NET/images/Mac.svg b/Document-Processing/Data-Extraction/NET/images/Mac.svg
new file mode 100644
index 0000000000..2715a8a4ca
--- /dev/null
+++ b/Document-Processing/Data-Extraction/NET/images/Mac.svg
@@ -0,0 +1,10 @@
+
+
+
+
+
+
+
+
+
+
diff --git a/Document-Processing/Data-Extraction/NET/images/UWP.svg b/Document-Processing/Data-Extraction/NET/images/UWP.svg
new file mode 100644
index 0000000000..1d3b611288
--- /dev/null
+++ b/Document-Processing/Data-Extraction/NET/images/UWP.svg
@@ -0,0 +1,10 @@
+
+
+
+
+
+
+
+
+
+
diff --git a/Document-Processing/Data-Extraction/NET/images/WPF.svg b/Document-Processing/Data-Extraction/NET/images/WPF.svg
new file mode 100644
index 0000000000..34262fd5f9
--- /dev/null
+++ b/Document-Processing/Data-Extraction/NET/images/WPF.svg
@@ -0,0 +1,9 @@
+
+
+
+
+
+
+
+
+
diff --git a/Document-Processing/Data-Extraction/NET/images/WinUI.png b/Document-Processing/Data-Extraction/NET/images/WinUI.png
new file mode 100644
index 0000000000..404b341b0b
Binary files /dev/null and b/Document-Processing/Data-Extraction/NET/images/WinUI.png differ
diff --git a/Document-Processing/Data-Extraction/NET/images/Windows-Forms.svg b/Document-Processing/Data-Extraction/NET/images/Windows-Forms.svg
new file mode 100644
index 0000000000..340aa8b3fb
--- /dev/null
+++ b/Document-Processing/Data-Extraction/NET/images/Windows-Forms.svg
@@ -0,0 +1,9 @@
+
+
+
+
+
+
+
+
+
diff --git a/Document-Processing/Data-Extraction/NET/images/license-download.png b/Document-Processing/Data-Extraction/NET/images/license-download.png
new file mode 100644
index 0000000000..f3d1d37f61
Binary files /dev/null and b/Document-Processing/Data-Extraction/NET/images/license-download.png differ
diff --git a/Document-Processing/Data-Extraction/NET/images/start-trial-download-installer.png b/Document-Processing/Data-Extraction/NET/images/start-trial-download-installer.png
new file mode 100644
index 0000000000..deb5451712
Binary files /dev/null and b/Document-Processing/Data-Extraction/NET/images/start-trial-download-installer.png differ
diff --git a/Document-Processing/Data-Extraction/NET/images/start-trial-download-offline-installer.png b/Document-Processing/Data-Extraction/NET/images/start-trial-download-offline-installer.png
new file mode 100644
index 0000000000..ce3ec6bc93
Binary files /dev/null and b/Document-Processing/Data-Extraction/NET/images/start-trial-download-offline-installer.png differ
diff --git a/Document-Processing/Data-Extraction/NET/images/start-trial-download.png b/Document-Processing/Data-Extraction/NET/images/start-trial-download.png
new file mode 100644
index 0000000000..301ac6d14c
Binary files /dev/null and b/Document-Processing/Data-Extraction/NET/images/start-trial-download.png differ
diff --git a/Document-Processing/Data-Extraction/NET/images/trial-confirmation.png b/Document-Processing/Data-Extraction/NET/images/trial-confirmation.png
new file mode 100644
index 0000000000..841e88c60e
Binary files /dev/null and b/Document-Processing/Data-Extraction/NET/images/trial-confirmation.png differ
diff --git a/Document-Processing/Data-Extraction/NET/images/trial-download.png b/Document-Processing/Data-Extraction/NET/images/trial-download.png
new file mode 100644
index 0000000000..29eaf051df
Binary files /dev/null and b/Document-Processing/Data-Extraction/NET/images/trial-download.png differ
diff --git a/Document-Processing/Data-Extraction/NET/ocr-overview.md b/Document-Processing/Data-Extraction/NET/ocr-overview.md
new file mode 100644
index 0000000000..3259cc826a
--- /dev/null
+++ b/Document-Processing/Data-Extraction/NET/ocr-overview.md
@@ -0,0 +1,36 @@
+---
+title: Intro to OCR Processor | Syncfusion
+description: This page introduces the Syncfusion OCR Processor, highlighting its purpose, main features, and how to begin optical character recognition in .NET apps.
+platform: document-processing
+control: OCRProcessor
+documentation: UG
+keywords: OCR, Optical Character Recognition, Text Recognition
+---
+
+# Welcome to Syncfusion OCR Processor Library
+
+{% doccards %}
+
+{% doccard text="ASP.NET Core" link="/document-processing/data-extraction/net/ocr-processor/net-core" icon="/document-processing/data-extraction/net/images/ASP.NET-Core.png" %}
+
+{% doccard text="ASP.NET MVC" link="/document-processing/data-extraction/net/ocr-processor/aspnet-mvc" icon="/document-processing/data-extraction/net/images/ASP.NET-Mvc.svg" %}
+
+{% doccard text="Blazor" link="/document-processing/data-extraction/net/ocr-processor/blazor" icon="/document-processing/data-extraction/net/images/Blazor.svg" %}
+
+{% doccard text="Windows Forms" link="/document-processing/data-extraction/net/ocr-processor/Windows-Forms" icon="/document-processing/data-extraction/net/images/Windows-Forms.svg" %}
+
+{% doccard text="WPF" link="/document-processing/data-extraction/net/ocr-processor/WPF" icon="/document-processing/data-extraction/net/images/WPF.svg" %}
+
+{% doccard text="Linux" link="/document-processing/data-extraction/net/ocr-processor/Linux" icon="/document-processing/data-extraction/net/images/Linux.png" %}
+
+{% doccard text="Mac" link="/document-processing/data-extraction/net/ocr-processor/MAC" icon="/document-processing/data-extraction/net/images/Mac.svg" %}
+
+{% doccard text="Azure" link="/document-processing/data-extraction/net/ocr-processor/azure" icon="/document-processing/data-extraction/net/images/Azure.svg" %}
+
+{% doccard text="AWS" link="/document-processing/data-extraction/net/ocr-processor/AWS-Textract" icon="/document-processing/data-extraction/net/images/AWS.svg" %}
+
+{% enddoccards %}
+
+The Syncfusion® OCR Processor is a high‑performance .NET library designed for accurate text recognition from scanned documents, images, and PDF files. Optimized for modern .NET workflows, it processes raster images and document pages to recognize printed text, analyze page layouts, and programmatically extract searchable content.
+
+This OCR Processor supports widely used document formats and provides a streamlined API for converting image‑based files into machine‑readable text. It is ideal for scenarios such as document digitization, full‑text search, content indexing, and enterprise data processing, enabling developers to integrate OCR functionality seamlessly into their applications.
\ No newline at end of file
diff --git a/Document-Processing/Data-Extraction/OCR/NET/AWS-Textract.md b/Document-Processing/Data-Extraction/NET/ocr-processor/AWS-Textract.md
similarity index 100%
rename from Document-Processing/Data-Extraction/OCR/NET/AWS-Textract.md
rename to Document-Processing/Data-Extraction/NET/ocr-processor/AWS-Textract.md
diff --git a/Document-Processing/Data-Extraction/OCR/NET/Amazon-Linux-EC2-Setup-Guide.md b/Document-Processing/Data-Extraction/NET/ocr-processor/Amazon-Linux-EC2-Setup-Guide.md
similarity index 100%
rename from Document-Processing/Data-Extraction/OCR/NET/Amazon-Linux-EC2-Setup-Guide.md
rename to Document-Processing/Data-Extraction/NET/ocr-processor/Amazon-Linux-EC2-Setup-Guide.md
diff --git a/Document-Processing/Data-Extraction/NET/ocr-processor/Assemblies-Required.md b/Document-Processing/Data-Extraction/NET/ocr-processor/Assemblies-Required.md
new file mode 100644
index 0000000000..1bfee7e8c7
--- /dev/null
+++ b/Document-Processing/Data-Extraction/NET/ocr-processor/Assemblies-Required.md
@@ -0,0 +1,63 @@
+---
+title: Assemblies Required for OCR | Syncfusion
+description: This section describes the required Syncfusion assemblies needed to integrate and use the OCR Processor effectively in your applications
+platform: document-processing
+control: PDF
+documentation: UG
+keywords: Assemblies
+---
+# Assemblies Required to work with OCR processor
+
+Get the following required assemblies by downloading the OCR library installer. Download and install the OCR library for Windows, Linux, and Mac respectively. Please refer to the advanced installation steps for more details.
+
+#### Syncfusion® OCR Processor Assemblies
+
+
+
+
+ Platform(s)
+ Assemblies
+
+
+
+
+
+ {{'WPF'| markdownify }},
+ {{'Windows Forms'| markdownify }},
+ {{'ASP.NET'| markdownify }} and {{'ASP.NET MVC'| markdownify }}
+
+
+ Syncfusion.OCRProcessor.Base
+ Syncfusion.Pdf.Base
+ Syncfusion.Compression.Base
+ Syncfusion.ImagePreProcessor.Base
+
+
+
+
+ {{'.NET Standard 2.0'| markdownify }}
+
+
+ Syncfusion.OCRProcessor.Portable
+ Syncfusion.Pdf.Imaging.Portable
+ Syncfusion.Pdf.Portable
+ Syncfusion.Compression.Portable
+ {{'[SkiaSharp](https://www.nuget.org/packages/SkiaSharp/3.119.1)'| markdownify }} package
+ Syncfusion.ImagePreProcessor.Portable
+
+
+
+
+ {{'.NET 8/.NET 9/.NET 10'| markdownify }}
+
+
+ Syncfusion.OCRProcessor.NET
+ Syncfusion.Pdf.Imaging.NET
+ Syncfusion.Pdf.NET
+ Syncfusion.Compression.NET
+ {{'[SkiaSharp](https://www.nuget.org/packages/SkiaSharp/3.119.1)'| markdownify }} package
+ Syncfusion.ImagePreProcessor.NET
+
+
+
+
\ No newline at end of file
diff --git a/Document-Processing/Data-Extraction/OCR/NET/Azure-Kubernetes-Service.md b/Document-Processing/Data-Extraction/NET/ocr-processor/Azure-Kubernetes-Service.md
similarity index 100%
rename from Document-Processing/Data-Extraction/OCR/NET/Azure-Kubernetes-Service.md
rename to Document-Processing/Data-Extraction/NET/ocr-processor/Azure-Kubernetes-Service.md
diff --git a/Document-Processing/Data-Extraction/OCR/NET/Azure-Vision.md b/Document-Processing/Data-Extraction/NET/ocr-processor/Azure-Vision.md
similarity index 100%
rename from Document-Processing/Data-Extraction/OCR/NET/Azure-Vision.md
rename to Document-Processing/Data-Extraction/NET/ocr-processor/Azure-Vision.md
diff --git a/Document-Processing/Data-Extraction/OCR/NET/Docker.md b/Document-Processing/Data-Extraction/NET/ocr-processor/Docker.md
similarity index 100%
rename from Document-Processing/Data-Extraction/OCR/NET/Docker.md
rename to Document-Processing/Data-Extraction/NET/ocr-processor/Docker.md
diff --git a/Document-Processing/Data-Extraction/OCR/NET/Features.md b/Document-Processing/Data-Extraction/NET/ocr-processor/Features.md
similarity index 99%
rename from Document-Processing/Data-Extraction/OCR/NET/Features.md
rename to Document-Processing/Data-Extraction/NET/ocr-processor/Features.md
index 57782a9569..33d897c5bb 100644
--- a/Document-Processing/Data-Extraction/OCR/NET/Features.md
+++ b/Document-Processing/Data-Extraction/NET/ocr-processor/Features.md
@@ -150,13 +150,13 @@ End Using
You can downloaded a complete working sample from [GitHub](https://github.com/SyncfusionExamples/PDF-Examples/tree/master/OCR/.NET/Perform-OCR-on-particular-region-of-PDF-document).
-## Performing OCR with tesseract version 3.05
-The [TesseractVersion](https://help.syncfusion.com/cr/document-processing/Syncfusion.OCRProcessor.OCRSettings.html#Syncfusion_OCRProcessor_OCRSettings_TesseractVersion) property is used to switch the tesseract version between 3.02 and 3.05. By default, OCR works with tesseract version 5.0.
-N> The starting supported version of tesseract in ASP.NET Core is 4.0. So the lower tesseract versions 3.02 and 3.05 are not supported and we don't have the property called ``TesseractVersion`` in ASP.NET Core platform.
+## Performing OCR with Tesseract Version 5.0
-The following code sample demonstrates the OCR processor with Tesseract version 3.05 for PDF documents.
+The [TesseractVersion](https://help.syncfusion.com/cr/document-processing/Syncfusion.OCRProcessor.OCRSettings.html#Syncfusion_OCRProcessor_OCRSettings_TesseractVersion) property is used to switch the tesseract version to 5.0. By default, OCR will be performed with tesseract version 5.0.
+
+The following code sample explains the OCR processor with Tesseract version 5.0 for PDF documents.
{% tabs %}
@@ -174,7 +174,7 @@ using (OCRProcessor processor = new OCRProcessor())
//Set OCR language.
processor.Settings.Language = Languages.English;
//Set tesseract OCR Engine.
- processor.Settings.TesseractVersion = TesseractVersion.Version3_05;
+ processor.Settings.TesseractVersion = TesseractVersion.Version5_0;
//Perform OCR with input document, tessdata (Language packs) and enabling isMemoryOptimized property.
processor.PerformOCR(document);
@@ -192,27 +192,26 @@ Imports Syncfusion.OCRProcessor
Imports Syncfusion.Pdf.Parsing
'Initialize the OCR processor with tesseract binaries folder path.
-Using processor As OCRProcessor = New OCRProcessor("TesseractBinaries/3.05/")
+Using processor As OCRProcessor = New OCRProcessor("TesseractBinaries/5.0/")
'Load an existing PDF document.
Dim document As PdfLoadedDocument = New PdfLoadedDocument("Input.pdf")
'Set OCR language.
processor.Settings.Language = Languages.English
'Set tesseract OCR Engine.
- processor.Settings.TesseractVersion = TesseractVersion.Version3_05
+ processor.Settings.TesseractVersion = TesseractVersion.Version5_0
'Perform OCR with input document, tessdata (Language packs) and enabling isMemoryOptimized property.
processor.PerformOCR(document)
-
'Save the PDF document.
document.Save("Output.pdf")
'Close the document.
document.Close(True)
End Using
-
+
{% endhighlight %}
-{% endtabs %}
+{% endtabs %}
## Performing OCR with Tesseract Version 4.0
@@ -275,11 +274,13 @@ End Using
{% endtabs %}
-## Performing OCR with Tesseract Version 5.0
+## Performing OCR with tesseract version 3.05
-The [TesseractVersion](https://help.syncfusion.com/cr/document-processing/Syncfusion.OCRProcessor.OCRSettings.html#Syncfusion_OCRProcessor_OCRSettings_TesseractVersion) property is used to switch the tesseract version to 5.0. By default, OCR will be performed with tesseract version 5.0.
+The [TesseractVersion](https://help.syncfusion.com/cr/document-processing/Syncfusion.OCRProcessor.OCRSettings.html#Syncfusion_OCRProcessor_OCRSettings_TesseractVersion) property is used to switch the tesseract version between 3.02 and 3.05. By default, OCR works with tesseract version 5.0.
-The following code sample explains the OCR processor with Tesseract version 5.0 for PDF documents.
+N> The starting supported version of tesseract in ASP.NET Core is 4.0. So the lower tesseract versions 3.02 and 3.05 are not supported and we don't have the property called ``TesseractVersion`` in ASP.NET Core platform.
+
+The following code sample demonstrates the OCR processor with Tesseract version 3.05 for PDF documents.
{% tabs %}
@@ -297,7 +298,7 @@ using (OCRProcessor processor = new OCRProcessor())
//Set OCR language.
processor.Settings.Language = Languages.English;
//Set tesseract OCR Engine.
- processor.Settings.TesseractVersion = TesseractVersion.Version5_0;
+ processor.Settings.TesseractVersion = TesseractVersion.Version3_05;
//Perform OCR with input document, tessdata (Language packs) and enabling isMemoryOptimized property.
processor.PerformOCR(document);
@@ -315,26 +316,27 @@ Imports Syncfusion.OCRProcessor
Imports Syncfusion.Pdf.Parsing
'Initialize the OCR processor with tesseract binaries folder path.
-Using processor As OCRProcessor = New OCRProcessor("TesseractBinaries/5.0/")
+Using processor As OCRProcessor = New OCRProcessor("TesseractBinaries/3.05/")
'Load an existing PDF document.
Dim document As PdfLoadedDocument = New PdfLoadedDocument("Input.pdf")
'Set OCR language.
processor.Settings.Language = Languages.English
'Set tesseract OCR Engine.
- processor.Settings.TesseractVersion = TesseractVersion.Version5_0
+ processor.Settings.TesseractVersion = TesseractVersion.Version3_05
'Perform OCR with input document, tessdata (Language packs) and enabling isMemoryOptimized property.
processor.PerformOCR(document)
+
'Save the PDF document.
document.Save("Output.pdf")
'Close the document.
document.Close(True)
End Using
-
+
{% endhighlight %}
-{% endtabs %}
+{% endtabs %}
## Performing OCR on image
diff --git a/Document-Processing/Data-Extraction/OCR/NET/Getting-started-overview.md b/Document-Processing/Data-Extraction/NET/ocr-processor/Getting-started-overview.md
similarity index 100%
rename from Document-Processing/Data-Extraction/OCR/NET/Getting-started-overview.md
rename to Document-Processing/Data-Extraction/NET/ocr-processor/Getting-started-overview.md
diff --git a/Document-Processing/Data-Extraction/OCR/NET/Linux.md b/Document-Processing/Data-Extraction/NET/ocr-processor/Linux.md
similarity index 100%
rename from Document-Processing/Data-Extraction/OCR/NET/Linux.md
rename to Document-Processing/Data-Extraction/NET/ocr-processor/Linux.md
diff --git a/Document-Processing/Data-Extraction/OCR/NET/MAC.md b/Document-Processing/Data-Extraction/NET/ocr-processor/MAC.md
similarity index 100%
rename from Document-Processing/Data-Extraction/OCR/NET/MAC.md
rename to Document-Processing/Data-Extraction/NET/ocr-processor/MAC.md
diff --git a/Document-Processing/Data-Extraction/OCR/NET/NuGet-Packages-Required.md b/Document-Processing/Data-Extraction/NET/ocr-processor/NuGet-Packages-Required.md
similarity index 77%
rename from Document-Processing/Data-Extraction/OCR/NET/NuGet-Packages-Required.md
rename to Document-Processing/Data-Extraction/NET/ocr-processor/NuGet-Packages-Required.md
index 84326ae80d..08292135b6 100644
--- a/Document-Processing/Data-Extraction/OCR/NET/NuGet-Packages-Required.md
+++ b/Document-Processing/Data-Extraction/NET/ocr-processor/NuGet-Packages-Required.md
@@ -5,9 +5,9 @@ platform: document-processing
control: PDF
documentation: UG
---
-# NuGet Packages Required for OCR processor
+## NuGet Packages Required for OCR Processor
-Directly install the NuGet package to your application from [nuget.org](https://www.nuget.org/).
+To work with the OCR Processor, the following NuGet packages need to be installed in your application from [nuget.org](https://www.nuget.org/).
@@ -48,7 +48,7 @@ Console Application (Targeting .NET Core)
Blazor
-{{'[Syncfusion.PDF.OCR.Net.Core](https://www.nuget.org/packages/Syncfusion.PDF.OCR.Net.Core)'| markdownify }}
+{{'[Syncfusion.Pdf.OCR.Net.Core](https://www.nuget.org/packages/Syncfusion.Pdf.OCR.Net.Core)'| markdownify }}
\ No newline at end of file
diff --git a/Document-Processing/Data-Extraction/OCR/NET/OCR-Images/Apply-docker-aks.png b/Document-Processing/Data-Extraction/NET/ocr-processor/OCR-Images/Apply-docker-aks.png
similarity index 100%
rename from Document-Processing/Data-Extraction/OCR/NET/OCR-Images/Apply-docker-aks.png
rename to Document-Processing/Data-Extraction/NET/ocr-processor/OCR-Images/Apply-docker-aks.png
diff --git a/Document-Processing/Data-Extraction/OCR/NET/OCR-Images/AzureFunctions1.png b/Document-Processing/Data-Extraction/NET/ocr-processor/OCR-Images/AzureFunctions1.png
similarity index 100%
rename from Document-Processing/Data-Extraction/OCR/NET/OCR-Images/AzureFunctions1.png
rename to Document-Processing/Data-Extraction/NET/ocr-processor/OCR-Images/AzureFunctions1.png
diff --git a/Document-Processing/Data-Extraction/OCR/NET/OCR-Images/AzureFunctions10.png b/Document-Processing/Data-Extraction/NET/ocr-processor/OCR-Images/AzureFunctions10.png
similarity index 100%
rename from Document-Processing/Data-Extraction/OCR/NET/OCR-Images/AzureFunctions10.png
rename to Document-Processing/Data-Extraction/NET/ocr-processor/OCR-Images/AzureFunctions10.png
diff --git a/Document-Processing/Data-Extraction/OCR/NET/OCR-Images/AzureFunctions11.png b/Document-Processing/Data-Extraction/NET/ocr-processor/OCR-Images/AzureFunctions11.png
similarity index 100%
rename from Document-Processing/Data-Extraction/OCR/NET/OCR-Images/AzureFunctions11.png
rename to Document-Processing/Data-Extraction/NET/ocr-processor/OCR-Images/AzureFunctions11.png
diff --git a/Document-Processing/Data-Extraction/OCR/NET/OCR-Images/AzureFunctions12.png b/Document-Processing/Data-Extraction/NET/ocr-processor/OCR-Images/AzureFunctions12.png
similarity index 100%
rename from Document-Processing/Data-Extraction/OCR/NET/OCR-Images/AzureFunctions12.png
rename to Document-Processing/Data-Extraction/NET/ocr-processor/OCR-Images/AzureFunctions12.png
diff --git a/Document-Processing/Data-Extraction/OCR/NET/OCR-Images/AzureFunctions13.png b/Document-Processing/Data-Extraction/NET/ocr-processor/OCR-Images/AzureFunctions13.png
similarity index 100%
rename from Document-Processing/Data-Extraction/OCR/NET/OCR-Images/AzureFunctions13.png
rename to Document-Processing/Data-Extraction/NET/ocr-processor/OCR-Images/AzureFunctions13.png
diff --git a/Document-Processing/Data-Extraction/OCR/NET/OCR-Images/AzureFunctions2.png b/Document-Processing/Data-Extraction/NET/ocr-processor/OCR-Images/AzureFunctions2.png
similarity index 100%
rename from Document-Processing/Data-Extraction/OCR/NET/OCR-Images/AzureFunctions2.png
rename to Document-Processing/Data-Extraction/NET/ocr-processor/OCR-Images/AzureFunctions2.png
diff --git a/Document-Processing/Data-Extraction/OCR/NET/OCR-Images/AzureFunctions3.png b/Document-Processing/Data-Extraction/NET/ocr-processor/OCR-Images/AzureFunctions3.png
similarity index 100%
rename from Document-Processing/Data-Extraction/OCR/NET/OCR-Images/AzureFunctions3.png
rename to Document-Processing/Data-Extraction/NET/ocr-processor/OCR-Images/AzureFunctions3.png
diff --git a/Document-Processing/Data-Extraction/OCR/NET/OCR-Images/AzureFunctions4.png b/Document-Processing/Data-Extraction/NET/ocr-processor/OCR-Images/AzureFunctions4.png
similarity index 100%
rename from Document-Processing/Data-Extraction/OCR/NET/OCR-Images/AzureFunctions4.png
rename to Document-Processing/Data-Extraction/NET/ocr-processor/OCR-Images/AzureFunctions4.png
diff --git a/Document-Processing/Data-Extraction/OCR/NET/OCR-Images/AzureFunctions5.png b/Document-Processing/Data-Extraction/NET/ocr-processor/OCR-Images/AzureFunctions5.png
similarity index 100%
rename from Document-Processing/Data-Extraction/OCR/NET/OCR-Images/AzureFunctions5.png
rename to Document-Processing/Data-Extraction/NET/ocr-processor/OCR-Images/AzureFunctions5.png
diff --git a/Document-Processing/Data-Extraction/OCR/NET/OCR-Images/AzureFunctions7.png b/Document-Processing/Data-Extraction/NET/ocr-processor/OCR-Images/AzureFunctions7.png
similarity index 100%
rename from Document-Processing/Data-Extraction/OCR/NET/OCR-Images/AzureFunctions7.png
rename to Document-Processing/Data-Extraction/NET/ocr-processor/OCR-Images/AzureFunctions7.png
diff --git a/Document-Processing/Data-Extraction/OCR/NET/OCR-Images/AzureFunctions8.png b/Document-Processing/Data-Extraction/NET/ocr-processor/OCR-Images/AzureFunctions8.png
similarity index 100%
rename from Document-Processing/Data-Extraction/OCR/NET/OCR-Images/AzureFunctions8.png
rename to Document-Processing/Data-Extraction/NET/ocr-processor/OCR-Images/AzureFunctions8.png
diff --git a/Document-Processing/Data-Extraction/OCR/NET/OCR-Images/AzureFunctions9.png b/Document-Processing/Data-Extraction/NET/ocr-processor/OCR-Images/AzureFunctions9.png
similarity index 100%
rename from Document-Processing/Data-Extraction/OCR/NET/OCR-Images/AzureFunctions9.png
rename to Document-Processing/Data-Extraction/NET/ocr-processor/OCR-Images/AzureFunctions9.png
diff --git a/Document-Processing/Data-Extraction/OCR/NET/OCR-Images/Azure_configuration_window1.png b/Document-Processing/Data-Extraction/NET/ocr-processor/OCR-Images/Azure_configuration_window1.png
similarity index 100%
rename from Document-Processing/Data-Extraction/OCR/NET/OCR-Images/Azure_configuration_window1.png
rename to Document-Processing/Data-Extraction/NET/ocr-processor/OCR-Images/Azure_configuration_window1.png
diff --git a/Document-Processing/Data-Extraction/OCR/NET/OCR-Images/Blazor-Server-App-JetBrains.png b/Document-Processing/Data-Extraction/NET/ocr-processor/OCR-Images/Blazor-Server-App-JetBrains.png
similarity index 100%
rename from Document-Processing/Data-Extraction/OCR/NET/OCR-Images/Blazor-Server-App-JetBrains.png
rename to Document-Processing/Data-Extraction/NET/ocr-processor/OCR-Images/Blazor-Server-App-JetBrains.png
diff --git a/Document-Processing/Data-Extraction/OCR/NET/OCR-Images/Button-docker-aks.png b/Document-Processing/Data-Extraction/NET/ocr-processor/OCR-Images/Button-docker-aks.png
similarity index 100%
rename from Document-Processing/Data-Extraction/OCR/NET/OCR-Images/Button-docker-aks.png
rename to Document-Processing/Data-Extraction/NET/ocr-processor/OCR-Images/Button-docker-aks.png
diff --git a/Document-Processing/Data-Extraction/OCR/NET/OCR-Images/Core_sample_creation_step1.png b/Document-Processing/Data-Extraction/NET/ocr-processor/OCR-Images/Core_sample_creation_step1.png
similarity index 100%
rename from Document-Processing/Data-Extraction/OCR/NET/OCR-Images/Core_sample_creation_step1.png
rename to Document-Processing/Data-Extraction/NET/ocr-processor/OCR-Images/Core_sample_creation_step1.png
diff --git a/Document-Processing/Data-Extraction/OCR/NET/OCR-Images/Core_sample_creation_step2.png b/Document-Processing/Data-Extraction/NET/ocr-processor/OCR-Images/Core_sample_creation_step2.png
similarity index 100%
rename from Document-Processing/Data-Extraction/OCR/NET/OCR-Images/Core_sample_creation_step2.png
rename to Document-Processing/Data-Extraction/NET/ocr-processor/OCR-Images/Core_sample_creation_step2.png
diff --git a/Document-Processing/Data-Extraction/OCR/NET/OCR-Images/Core_sample_creation_step3.png b/Document-Processing/Data-Extraction/NET/ocr-processor/OCR-Images/Core_sample_creation_step3.png
similarity index 100%
rename from Document-Processing/Data-Extraction/OCR/NET/OCR-Images/Core_sample_creation_step3.png
rename to Document-Processing/Data-Extraction/NET/ocr-processor/OCR-Images/Core_sample_creation_step3.png
diff --git a/Document-Processing/Data-Extraction/OCR/NET/OCR-Images/Core_sample_creation_step4.png b/Document-Processing/Data-Extraction/NET/ocr-processor/OCR-Images/Core_sample_creation_step4.png
similarity index 100%
rename from Document-Processing/Data-Extraction/OCR/NET/OCR-Images/Core_sample_creation_step4.png
rename to Document-Processing/Data-Extraction/NET/ocr-processor/OCR-Images/Core_sample_creation_step4.png
diff --git a/Document-Processing/Data-Extraction/OCR/NET/OCR-Images/Deploy-docker-aks.png b/Document-Processing/Data-Extraction/NET/ocr-processor/OCR-Images/Deploy-docker-aks.png
similarity index 100%
rename from Document-Processing/Data-Extraction/OCR/NET/OCR-Images/Deploy-docker-aks.png
rename to Document-Processing/Data-Extraction/NET/ocr-processor/OCR-Images/Deploy-docker-aks.png
diff --git a/Document-Processing/Data-Extraction/OCR/NET/OCR-Images/Deployment_type.png b/Document-Processing/Data-Extraction/NET/ocr-processor/OCR-Images/Deployment_type.png
similarity index 100%
rename from Document-Processing/Data-Extraction/OCR/NET/OCR-Images/Deployment_type.png
rename to Document-Processing/Data-Extraction/NET/ocr-processor/OCR-Images/Deployment_type.png
diff --git a/Document-Processing/Data-Extraction/OCR/NET/OCR-Images/Docker_file_commends.png b/Document-Processing/Data-Extraction/NET/ocr-processor/OCR-Images/Docker_file_commends.png
similarity index 100%
rename from Document-Processing/Data-Extraction/OCR/NET/OCR-Images/Docker_file_commends.png
rename to Document-Processing/Data-Extraction/NET/ocr-processor/OCR-Images/Docker_file_commends.png
diff --git a/Document-Processing/Data-Extraction/OCR/NET/OCR-Images/Install-Blazor-JetBrains-Package.png b/Document-Processing/Data-Extraction/NET/ocr-processor/OCR-Images/Install-Blazor-JetBrains-Package.png
similarity index 100%
rename from Document-Processing/Data-Extraction/OCR/NET/OCR-Images/Install-Blazor-JetBrains-Package.png
rename to Document-Processing/Data-Extraction/NET/ocr-processor/OCR-Images/Install-Blazor-JetBrains-Package.png
diff --git a/Document-Processing/Data-Extraction/OCR/NET/OCR-Images/Install-MVC-Package.png b/Document-Processing/Data-Extraction/NET/ocr-processor/OCR-Images/Install-MVC-Package.png
similarity index 100%
rename from Document-Processing/Data-Extraction/OCR/NET/OCR-Images/Install-MVC-Package.png
rename to Document-Processing/Data-Extraction/NET/ocr-processor/OCR-Images/Install-MVC-Package.png
diff --git a/Document-Processing/Data-Extraction/OCR/NET/OCR-Images/Install-leptonica.png b/Document-Processing/Data-Extraction/NET/ocr-processor/OCR-Images/Install-leptonica.png
similarity index 100%
rename from Document-Processing/Data-Extraction/OCR/NET/OCR-Images/Install-leptonica.png
rename to Document-Processing/Data-Extraction/NET/ocr-processor/OCR-Images/Install-leptonica.png
diff --git a/Document-Processing/Data-Extraction/OCR/NET/OCR-Images/Install-tesseract.png b/Document-Processing/Data-Extraction/NET/ocr-processor/OCR-Images/Install-tesseract.png
similarity index 100%
rename from Document-Processing/Data-Extraction/OCR/NET/OCR-Images/Install-tesseract.png
rename to Document-Processing/Data-Extraction/NET/ocr-processor/OCR-Images/Install-tesseract.png
diff --git a/Document-Processing/Data-Extraction/OCR/NET/OCR-Images/JetBrains-Package.png b/Document-Processing/Data-Extraction/NET/ocr-processor/OCR-Images/JetBrains-Package.png
similarity index 100%
rename from Document-Processing/Data-Extraction/OCR/NET/OCR-Images/JetBrains-Package.png
rename to Document-Processing/Data-Extraction/NET/ocr-processor/OCR-Images/JetBrains-Package.png
diff --git a/Document-Processing/Data-Extraction/OCR/NET/OCR-Images/LinuxStep1.png b/Document-Processing/Data-Extraction/NET/ocr-processor/OCR-Images/LinuxStep1.png
similarity index 100%
rename from Document-Processing/Data-Extraction/OCR/NET/OCR-Images/LinuxStep1.png
rename to Document-Processing/Data-Extraction/NET/ocr-processor/OCR-Images/LinuxStep1.png
diff --git a/Document-Processing/Data-Extraction/OCR/NET/OCR-Images/LinuxStep2.png b/Document-Processing/Data-Extraction/NET/ocr-processor/OCR-Images/LinuxStep2.png
similarity index 100%
rename from Document-Processing/Data-Extraction/OCR/NET/OCR-Images/LinuxStep2.png
rename to Document-Processing/Data-Extraction/NET/ocr-processor/OCR-Images/LinuxStep2.png
diff --git a/Document-Processing/Data-Extraction/OCR/NET/OCR-Images/LinuxStep3.png b/Document-Processing/Data-Extraction/NET/ocr-processor/OCR-Images/LinuxStep3.png
similarity index 100%
rename from Document-Processing/Data-Extraction/OCR/NET/OCR-Images/LinuxStep3.png
rename to Document-Processing/Data-Extraction/NET/ocr-processor/OCR-Images/LinuxStep3.png
diff --git a/Document-Processing/Data-Extraction/OCR/NET/OCR-Images/LinuxStep4.png b/Document-Processing/Data-Extraction/NET/ocr-processor/OCR-Images/LinuxStep4.png
similarity index 100%
rename from Document-Processing/Data-Extraction/OCR/NET/OCR-Images/LinuxStep4.png
rename to Document-Processing/Data-Extraction/NET/ocr-processor/OCR-Images/LinuxStep4.png
diff --git a/Document-Processing/Data-Extraction/OCR/NET/OCR-Images/LinuxStep5.png b/Document-Processing/Data-Extraction/NET/ocr-processor/OCR-Images/LinuxStep5.png
similarity index 100%
rename from Document-Processing/Data-Extraction/OCR/NET/OCR-Images/LinuxStep5.png
rename to Document-Processing/Data-Extraction/NET/ocr-processor/OCR-Images/LinuxStep5.png
diff --git a/Document-Processing/Data-Extraction/OCR/NET/OCR-Images/Mac_OS_Console.png b/Document-Processing/Data-Extraction/NET/ocr-processor/OCR-Images/Mac_OS_Console.png
similarity index 100%
rename from Document-Processing/Data-Extraction/OCR/NET/OCR-Images/Mac_OS_Console.png
rename to Document-Processing/Data-Extraction/NET/ocr-processor/OCR-Images/Mac_OS_Console.png
diff --git a/Document-Processing/Data-Extraction/OCR/NET/OCR-Images/Mac_OS_NuGet_path.png b/Document-Processing/Data-Extraction/NET/ocr-processor/OCR-Images/Mac_OS_NuGet_path.png
similarity index 100%
rename from Document-Processing/Data-Extraction/OCR/NET/OCR-Images/Mac_OS_NuGet_path.png
rename to Document-Processing/Data-Extraction/NET/ocr-processor/OCR-Images/Mac_OS_NuGet_path.png
diff --git a/Document-Processing/Data-Extraction/OCR/NET/OCR-Images/NET-sample-Azure-step1.png b/Document-Processing/Data-Extraction/NET/ocr-processor/OCR-Images/NET-sample-Azure-step1.png
similarity index 100%
rename from Document-Processing/Data-Extraction/OCR/NET/OCR-Images/NET-sample-Azure-step1.png
rename to Document-Processing/Data-Extraction/NET/ocr-processor/OCR-Images/NET-sample-Azure-step1.png
diff --git a/Document-Processing/Data-Extraction/OCR/NET/OCR-Images/NET-sample-Azure-step2.png b/Document-Processing/Data-Extraction/NET/ocr-processor/OCR-Images/NET-sample-Azure-step2.png
similarity index 100%
rename from Document-Processing/Data-Extraction/OCR/NET/OCR-Images/NET-sample-Azure-step2.png
rename to Document-Processing/Data-Extraction/NET/ocr-processor/OCR-Images/NET-sample-Azure-step2.png
diff --git a/Document-Processing/Data-Extraction/OCR/NET/OCR-Images/NET-sample-Azure-step3.png b/Document-Processing/Data-Extraction/NET/ocr-processor/OCR-Images/NET-sample-Azure-step3.png
similarity index 100%
rename from Document-Processing/Data-Extraction/OCR/NET/OCR-Images/NET-sample-Azure-step3.png
rename to Document-Processing/Data-Extraction/NET/ocr-processor/OCR-Images/NET-sample-Azure-step3.png
diff --git a/Document-Processing/Data-Extraction/OCR/NET/OCR-Images/NET-sample-Azure-step4.png b/Document-Processing/Data-Extraction/NET/ocr-processor/OCR-Images/NET-sample-Azure-step4.png
similarity index 100%
rename from Document-Processing/Data-Extraction/OCR/NET/OCR-Images/NET-sample-Azure-step4.png
rename to Document-Processing/Data-Extraction/NET/ocr-processor/OCR-Images/NET-sample-Azure-step4.png
diff --git a/Document-Processing/Data-Extraction/OCR/NET/OCR-Images/NET-sample-creation-step1.png b/Document-Processing/Data-Extraction/NET/ocr-processor/OCR-Images/NET-sample-creation-step1.png
similarity index 100%
rename from Document-Processing/Data-Extraction/OCR/NET/OCR-Images/NET-sample-creation-step1.png
rename to Document-Processing/Data-Extraction/NET/ocr-processor/OCR-Images/NET-sample-creation-step1.png
diff --git a/Document-Processing/Data-Extraction/OCR/NET/OCR-Images/NET-sample-creation-step2.png b/Document-Processing/Data-Extraction/NET/ocr-processor/OCR-Images/NET-sample-creation-step2.png
similarity index 100%
rename from Document-Processing/Data-Extraction/OCR/NET/OCR-Images/NET-sample-creation-step2.png
rename to Document-Processing/Data-Extraction/NET/ocr-processor/OCR-Images/NET-sample-creation-step2.png
diff --git a/Document-Processing/Data-Extraction/OCR/NET/OCR-Images/NET-sample-creation-step3.png b/Document-Processing/Data-Extraction/NET/ocr-processor/OCR-Images/NET-sample-creation-step3.png
similarity index 100%
rename from Document-Processing/Data-Extraction/OCR/NET/OCR-Images/NET-sample-creation-step3.png
rename to Document-Processing/Data-Extraction/NET/ocr-processor/OCR-Images/NET-sample-creation-step3.png
diff --git a/Document-Processing/Data-Extraction/OCR/NET/OCR-Images/NET-sample-creation-step4.png b/Document-Processing/Data-Extraction/NET/ocr-processor/OCR-Images/NET-sample-creation-step4.png
similarity index 100%
rename from Document-Processing/Data-Extraction/OCR/NET/OCR-Images/NET-sample-creation-step4.png
rename to Document-Processing/Data-Extraction/NET/ocr-processor/OCR-Images/NET-sample-creation-step4.png
diff --git a/Document-Processing/Data-Extraction/OCR/NET/OCR-Images/OCR-ASPNET-Step1.png b/Document-Processing/Data-Extraction/NET/ocr-processor/OCR-Images/OCR-ASPNET-Step1.png
similarity index 100%
rename from Document-Processing/Data-Extraction/OCR/NET/OCR-Images/OCR-ASPNET-Step1.png
rename to Document-Processing/Data-Extraction/NET/ocr-processor/OCR-Images/OCR-ASPNET-Step1.png
diff --git a/Document-Processing/Data-Extraction/OCR/NET/OCR-Images/OCR-ASPNET-Step2.png b/Document-Processing/Data-Extraction/NET/ocr-processor/OCR-Images/OCR-ASPNET-Step2.png
similarity index 100%
rename from Document-Processing/Data-Extraction/OCR/NET/OCR-Images/OCR-ASPNET-Step2.png
rename to Document-Processing/Data-Extraction/NET/ocr-processor/OCR-Images/OCR-ASPNET-Step2.png
diff --git a/Document-Processing/Data-Extraction/OCR/NET/OCR-Images/OCR-ASPNET-Step3.png b/Document-Processing/Data-Extraction/NET/ocr-processor/OCR-Images/OCR-ASPNET-Step3.png
similarity index 100%
rename from Document-Processing/Data-Extraction/OCR/NET/OCR-Images/OCR-ASPNET-Step3.png
rename to Document-Processing/Data-Extraction/NET/ocr-processor/OCR-Images/OCR-ASPNET-Step3.png
diff --git a/Document-Processing/Data-Extraction/OCR/NET/OCR-Images/OCR-ASPNET-Step4.png b/Document-Processing/Data-Extraction/NET/ocr-processor/OCR-Images/OCR-ASPNET-Step4.png
similarity index 100%
rename from Document-Processing/Data-Extraction/OCR/NET/OCR-Images/OCR-ASPNET-Step4.png
rename to Document-Processing/Data-Extraction/NET/ocr-processor/OCR-Images/OCR-ASPNET-Step4.png
diff --git a/Document-Processing/Data-Extraction/OCR/NET/OCR-Images/OCR-Core-NuGet-package.png b/Document-Processing/Data-Extraction/NET/ocr-processor/OCR-Images/OCR-Core-NuGet-package.png
similarity index 100%
rename from Document-Processing/Data-Extraction/OCR/NET/OCR-Images/OCR-Core-NuGet-package.png
rename to Document-Processing/Data-Extraction/NET/ocr-processor/OCR-Images/OCR-Core-NuGet-package.png
diff --git a/Document-Processing/Data-Extraction/OCR/NET/OCR-Images/OCR-Core-app-creation.png b/Document-Processing/Data-Extraction/NET/ocr-processor/OCR-Images/OCR-Core-app-creation.png
similarity index 100%
rename from Document-Processing/Data-Extraction/OCR/NET/OCR-Images/OCR-Core-app-creation.png
rename to Document-Processing/Data-Extraction/NET/ocr-processor/OCR-Images/OCR-Core-app-creation.png
diff --git a/Document-Processing/Data-Extraction/OCR/NET/OCR-Images/OCR-Core-project-configuration1.png b/Document-Processing/Data-Extraction/NET/ocr-processor/OCR-Images/OCR-Core-project-configuration1.png
similarity index 100%
rename from Document-Processing/Data-Extraction/OCR/NET/OCR-Images/OCR-Core-project-configuration1.png
rename to Document-Processing/Data-Extraction/NET/ocr-processor/OCR-Images/OCR-Core-project-configuration1.png
diff --git a/Document-Processing/Data-Extraction/OCR/NET/OCR-Images/OCR-Core-project-configuration2.png b/Document-Processing/Data-Extraction/NET/ocr-processor/OCR-Images/OCR-Core-project-configuration2.png
similarity index 100%
rename from Document-Processing/Data-Extraction/OCR/NET/OCR-Images/OCR-Core-project-configuration2.png
rename to Document-Processing/Data-Extraction/NET/ocr-processor/OCR-Images/OCR-Core-project-configuration2.png
diff --git a/Document-Processing/Data-Extraction/OCR/NET/OCR-Images/OCR-Docker-NuGet-package.png b/Document-Processing/Data-Extraction/NET/ocr-processor/OCR-Images/OCR-Docker-NuGet-package.png
similarity index 100%
rename from Document-Processing/Data-Extraction/OCR/NET/OCR-Images/OCR-Docker-NuGet-package.png
rename to Document-Processing/Data-Extraction/NET/ocr-processor/OCR-Images/OCR-Docker-NuGet-package.png
diff --git a/Document-Processing/Data-Extraction/OCR/NET/OCR-Images/OCR-MVC-NuGet-package.png b/Document-Processing/Data-Extraction/NET/ocr-processor/OCR-Images/OCR-MVC-NuGet-package.png
similarity index 100%
rename from Document-Processing/Data-Extraction/OCR/NET/OCR-Images/OCR-MVC-NuGet-package.png
rename to Document-Processing/Data-Extraction/NET/ocr-processor/OCR-Images/OCR-MVC-NuGet-package.png
diff --git a/Document-Processing/Data-Extraction/OCR/NET/OCR-Images/OCR-MVC-app-creation.png b/Document-Processing/Data-Extraction/NET/ocr-processor/OCR-Images/OCR-MVC-app-creation.png
similarity index 100%
rename from Document-Processing/Data-Extraction/OCR/NET/OCR-Images/OCR-MVC-app-creation.png
rename to Document-Processing/Data-Extraction/NET/ocr-processor/OCR-Images/OCR-MVC-app-creation.png
diff --git a/Document-Processing/Data-Extraction/OCR/NET/OCR-Images/OCR-MVC-project-configuration1.png b/Document-Processing/Data-Extraction/NET/ocr-processor/OCR-Images/OCR-MVC-project-configuration1.png
similarity index 100%
rename from Document-Processing/Data-Extraction/OCR/NET/OCR-Images/OCR-MVC-project-configuration1.png
rename to Document-Processing/Data-Extraction/NET/ocr-processor/OCR-Images/OCR-MVC-project-configuration1.png
diff --git a/Document-Processing/Data-Extraction/OCR/NET/OCR-Images/OCR-MVC-project-configuration2.png b/Document-Processing/Data-Extraction/NET/ocr-processor/OCR-Images/OCR-MVC-project-configuration2.png
similarity index 100%
rename from Document-Processing/Data-Extraction/OCR/NET/OCR-Images/OCR-MVC-project-configuration2.png
rename to Document-Processing/Data-Extraction/NET/ocr-processor/OCR-Images/OCR-MVC-project-configuration2.png
diff --git a/Document-Processing/Data-Extraction/OCR/NET/OCR-Images/OCR-NET-step1.png b/Document-Processing/Data-Extraction/NET/ocr-processor/OCR-Images/OCR-NET-step1.png
similarity index 100%
rename from Document-Processing/Data-Extraction/OCR/NET/OCR-Images/OCR-NET-step1.png
rename to Document-Processing/Data-Extraction/NET/ocr-processor/OCR-Images/OCR-NET-step1.png
diff --git a/Document-Processing/Data-Extraction/OCR/NET/OCR-Images/OCR-NET-step2.png b/Document-Processing/Data-Extraction/NET/ocr-processor/OCR-Images/OCR-NET-step2.png
similarity index 100%
rename from Document-Processing/Data-Extraction/OCR/NET/OCR-Images/OCR-NET-step2.png
rename to Document-Processing/Data-Extraction/NET/ocr-processor/OCR-Images/OCR-NET-step2.png
diff --git a/Document-Processing/Data-Extraction/OCR/NET/OCR-Images/OCR-NET-step3.png b/Document-Processing/Data-Extraction/NET/ocr-processor/OCR-Images/OCR-NET-step3.png
similarity index 100%
rename from Document-Processing/Data-Extraction/OCR/NET/OCR-Images/OCR-NET-step3.png
rename to Document-Processing/Data-Extraction/NET/ocr-processor/OCR-Images/OCR-NET-step3.png
diff --git a/Document-Processing/Data-Extraction/OCR/NET/OCR-Images/OCR-WF-NuGet-package.png b/Document-Processing/Data-Extraction/NET/ocr-processor/OCR-Images/OCR-WF-NuGet-package.png
similarity index 100%
rename from Document-Processing/Data-Extraction/OCR/NET/OCR-Images/OCR-WF-NuGet-package.png
rename to Document-Processing/Data-Extraction/NET/ocr-processor/OCR-Images/OCR-WF-NuGet-package.png
diff --git a/Document-Processing/Data-Extraction/OCR/NET/OCR-Images/OCR-WF-app-creation.png b/Document-Processing/Data-Extraction/NET/ocr-processor/OCR-Images/OCR-WF-app-creation.png
similarity index 100%
rename from Document-Processing/Data-Extraction/OCR/NET/OCR-Images/OCR-WF-app-creation.png
rename to Document-Processing/Data-Extraction/NET/ocr-processor/OCR-Images/OCR-WF-app-creation.png
diff --git a/Document-Processing/Data-Extraction/OCR/NET/OCR-Images/OCR-WF-configuraion-window.png b/Document-Processing/Data-Extraction/NET/ocr-processor/OCR-Images/OCR-WF-configuraion-window.png
similarity index 100%
rename from Document-Processing/Data-Extraction/OCR/NET/OCR-Images/OCR-WF-configuraion-window.png
rename to Document-Processing/Data-Extraction/NET/ocr-processor/OCR-Images/OCR-WF-configuraion-window.png
diff --git a/Document-Processing/Data-Extraction/OCR/NET/OCR-Images/OCR-WPF-NuGet-package.png b/Document-Processing/Data-Extraction/NET/ocr-processor/OCR-Images/OCR-WPF-NuGet-package.png
similarity index 100%
rename from Document-Processing/Data-Extraction/OCR/NET/OCR-Images/OCR-WPF-NuGet-package.png
rename to Document-Processing/Data-Extraction/NET/ocr-processor/OCR-Images/OCR-WPF-NuGet-package.png
diff --git a/Document-Processing/Data-Extraction/OCR/NET/OCR-Images/OCR-WPF-app-creation.png b/Document-Processing/Data-Extraction/NET/ocr-processor/OCR-Images/OCR-WPF-app-creation.png
similarity index 100%
rename from Document-Processing/Data-Extraction/OCR/NET/OCR-Images/OCR-WPF-app-creation.png
rename to Document-Processing/Data-Extraction/NET/ocr-processor/OCR-Images/OCR-WPF-app-creation.png
diff --git a/Document-Processing/Data-Extraction/OCR/NET/OCR-Images/OCR-WPF-project-configuration.png b/Document-Processing/Data-Extraction/NET/ocr-processor/OCR-Images/OCR-WPF-project-configuration.png
similarity index 100%
rename from Document-Processing/Data-Extraction/OCR/NET/OCR-Images/OCR-WPF-project-configuration.png
rename to Document-Processing/Data-Extraction/NET/ocr-processor/OCR-Images/OCR-WPF-project-configuration.png
diff --git a/Document-Processing/Data-Extraction/OCR/NET/OCR-Images/OCR-command-aks.png b/Document-Processing/Data-Extraction/NET/ocr-processor/OCR-Images/OCR-command-aks.png
similarity index 100%
rename from Document-Processing/Data-Extraction/OCR/NET/OCR-Images/OCR-command-aks.png
rename to Document-Processing/Data-Extraction/NET/ocr-processor/OCR-Images/OCR-command-aks.png
diff --git a/Document-Processing/Data-Extraction/OCR/NET/OCR-Images/OCR-docker-configuration-window.png b/Document-Processing/Data-Extraction/NET/ocr-processor/OCR-Images/OCR-docker-configuration-window.png
similarity index 100%
rename from Document-Processing/Data-Extraction/OCR/NET/OCR-Images/OCR-docker-configuration-window.png
rename to Document-Processing/Data-Extraction/NET/ocr-processor/OCR-Images/OCR-docker-configuration-window.png
diff --git a/Document-Processing/Data-Extraction/OCR/NET/OCR-Images/OCR-output-image.png b/Document-Processing/Data-Extraction/NET/ocr-processor/OCR-Images/OCR-output-image.png
similarity index 100%
rename from Document-Processing/Data-Extraction/OCR/NET/OCR-Images/OCR-output-image.png
rename to Document-Processing/Data-Extraction/NET/ocr-processor/OCR-Images/OCR-output-image.png
diff --git a/Document-Processing/Data-Extraction/OCR/NET/OCR-Images/OCRDocker1.png b/Document-Processing/Data-Extraction/NET/ocr-processor/OCR-Images/OCRDocker1.png
similarity index 100%
rename from Document-Processing/Data-Extraction/OCR/NET/OCR-Images/OCRDocker1.png
rename to Document-Processing/Data-Extraction/NET/ocr-processor/OCR-Images/OCRDocker1.png
diff --git a/Document-Processing/Data-Extraction/OCR/NET/OCR-Images/OCRDocker6.png b/Document-Processing/Data-Extraction/NET/ocr-processor/OCR-Images/OCRDocker6.png
similarity index 100%
rename from Document-Processing/Data-Extraction/OCR/NET/OCR-Images/OCRDocker6.png
rename to Document-Processing/Data-Extraction/NET/ocr-processor/OCR-Images/OCRDocker6.png
diff --git a/Document-Processing/Data-Extraction/OCR/NET/OCR-Images/OCR_docker_target.png b/Document-Processing/Data-Extraction/NET/ocr-processor/OCR-Images/OCR_docker_target.png
similarity index 100%
rename from Document-Processing/Data-Extraction/OCR/NET/OCR-Images/OCR_docker_target.png
rename to Document-Processing/Data-Extraction/NET/ocr-processor/OCR-Images/OCR_docker_target.png
diff --git a/Document-Processing/Data-Extraction/OCR/NET/OCR-Images/Output-genrate-webpage.png b/Document-Processing/Data-Extraction/NET/ocr-processor/OCR-Images/Output-genrate-webpage.png
similarity index 100%
rename from Document-Processing/Data-Extraction/OCR/NET/OCR-Images/Output-genrate-webpage.png
rename to Document-Processing/Data-Extraction/NET/ocr-processor/OCR-Images/Output-genrate-webpage.png
diff --git a/Document-Processing/Data-Extraction/OCR/NET/OCR-Images/Output.png b/Document-Processing/Data-Extraction/NET/ocr-processor/OCR-Images/Output.png
similarity index 100%
rename from Document-Processing/Data-Extraction/OCR/NET/OCR-Images/Output.png
rename to Document-Processing/Data-Extraction/NET/ocr-processor/OCR-Images/Output.png
diff --git a/Document-Processing/Data-Extraction/OCR/NET/OCR-Images/Push-docker-aks.png b/Document-Processing/Data-Extraction/NET/ocr-processor/OCR-Images/Push-docker-aks.png
similarity index 100%
rename from Document-Processing/Data-Extraction/OCR/NET/OCR-Images/Push-docker-aks.png
rename to Document-Processing/Data-Extraction/NET/ocr-processor/OCR-Images/Push-docker-aks.png
diff --git a/Document-Processing/Data-Extraction/OCR/NET/OCR-Images/Redistributable-file.png b/Document-Processing/Data-Extraction/NET/ocr-processor/OCR-Images/Redistributable-file.png
similarity index 100%
rename from Document-Processing/Data-Extraction/OCR/NET/OCR-Images/Redistributable-file.png
rename to Document-Processing/Data-Extraction/NET/ocr-processor/OCR-Images/Redistributable-file.png
diff --git a/Document-Processing/Data-Extraction/OCR/NET/OCR-Images/Service-docker-aks.png b/Document-Processing/Data-Extraction/NET/ocr-processor/OCR-Images/Service-docker-aks.png
similarity index 100%
rename from Document-Processing/Data-Extraction/OCR/NET/OCR-Images/Service-docker-aks.png
rename to Document-Processing/Data-Extraction/NET/ocr-processor/OCR-Images/Service-docker-aks.png
diff --git a/Document-Processing/Data-Extraction/OCR/NET/OCR-Images/Set_Copy_Always.png b/Document-Processing/Data-Extraction/NET/ocr-processor/OCR-Images/Set_Copy_Always.png
similarity index 100%
rename from Document-Processing/Data-Extraction/OCR/NET/OCR-Images/Set_Copy_Always.png
rename to Document-Processing/Data-Extraction/NET/ocr-processor/OCR-Images/Set_Copy_Always.png
diff --git a/Document-Processing/Data-Extraction/OCR/NET/OCR-Images/Tag-docker-image.png b/Document-Processing/Data-Extraction/NET/ocr-processor/OCR-Images/Tag-docker-image.png
similarity index 100%
rename from Document-Processing/Data-Extraction/OCR/NET/OCR-Images/Tag-docker-image.png
rename to Document-Processing/Data-Extraction/NET/ocr-processor/OCR-Images/Tag-docker-image.png
diff --git a/Document-Processing/Data-Extraction/OCR/NET/OCR-Images/Tessdata-path.png b/Document-Processing/Data-Extraction/NET/ocr-processor/OCR-Images/Tessdata-path.png
similarity index 100%
rename from Document-Processing/Data-Extraction/OCR/NET/OCR-Images/Tessdata-path.png
rename to Document-Processing/Data-Extraction/NET/ocr-processor/OCR-Images/Tessdata-path.png
diff --git a/Document-Processing/Data-Extraction/OCR/NET/OCR-Images/TessdataRemove.jpeg b/Document-Processing/Data-Extraction/NET/ocr-processor/OCR-Images/TessdataRemove.jpeg
similarity index 100%
rename from Document-Processing/Data-Extraction/OCR/NET/OCR-Images/TessdataRemove.jpeg
rename to Document-Processing/Data-Extraction/NET/ocr-processor/OCR-Images/TessdataRemove.jpeg
diff --git a/Document-Processing/Data-Extraction/OCR/NET/OCR-Images/Tessdata_Store.png b/Document-Processing/Data-Extraction/NET/ocr-processor/OCR-Images/Tessdata_Store.png
similarity index 100%
rename from Document-Processing/Data-Extraction/OCR/NET/OCR-Images/Tessdata_Store.png
rename to Document-Processing/Data-Extraction/NET/ocr-processor/OCR-Images/Tessdata_Store.png
diff --git a/Document-Processing/Data-Extraction/OCR/NET/OCR-Images/WF_sample_creation_step1.png b/Document-Processing/Data-Extraction/NET/ocr-processor/OCR-Images/WF_sample_creation_step1.png
similarity index 100%
rename from Document-Processing/Data-Extraction/OCR/NET/OCR-Images/WF_sample_creation_step1.png
rename to Document-Processing/Data-Extraction/NET/ocr-processor/OCR-Images/WF_sample_creation_step1.png
diff --git a/Document-Processing/Data-Extraction/OCR/NET/OCR-Images/WF_sample_creation_step2.png b/Document-Processing/Data-Extraction/NET/ocr-processor/OCR-Images/WF_sample_creation_step2.png
similarity index 100%
rename from Document-Processing/Data-Extraction/OCR/NET/OCR-Images/WF_sample_creation_step2.png
rename to Document-Processing/Data-Extraction/NET/ocr-processor/OCR-Images/WF_sample_creation_step2.png
diff --git a/Document-Processing/Data-Extraction/OCR/NET/OCR-Images/azure_NuGet_package.png b/Document-Processing/Data-Extraction/NET/ocr-processor/OCR-Images/azure_NuGet_package.png
similarity index 100%
rename from Document-Processing/Data-Extraction/OCR/NET/OCR-Images/azure_NuGet_package.png
rename to Document-Processing/Data-Extraction/NET/ocr-processor/OCR-Images/azure_NuGet_package.png
diff --git a/Document-Processing/Data-Extraction/OCR/NET/OCR-Images/azure_additional_information.png b/Document-Processing/Data-Extraction/NET/ocr-processor/OCR-Images/azure_additional_information.png
similarity index 100%
rename from Document-Processing/Data-Extraction/OCR/NET/OCR-Images/azure_additional_information.png
rename to Document-Processing/Data-Extraction/NET/ocr-processor/OCR-Images/azure_additional_information.png
diff --git a/Document-Processing/Data-Extraction/OCR/NET/OCR-Images/azure_step1.png b/Document-Processing/Data-Extraction/NET/ocr-processor/OCR-Images/azure_step1.png
similarity index 100%
rename from Document-Processing/Data-Extraction/OCR/NET/OCR-Images/azure_step1.png
rename to Document-Processing/Data-Extraction/NET/ocr-processor/OCR-Images/azure_step1.png
diff --git a/Document-Processing/Data-Extraction/OCR/NET/OCR-Images/azure_step10.png b/Document-Processing/Data-Extraction/NET/ocr-processor/OCR-Images/azure_step10.png
similarity index 100%
rename from Document-Processing/Data-Extraction/OCR/NET/OCR-Images/azure_step10.png
rename to Document-Processing/Data-Extraction/NET/ocr-processor/OCR-Images/azure_step10.png
diff --git a/Document-Processing/Data-Extraction/OCR/NET/OCR-Images/azure_step11.png b/Document-Processing/Data-Extraction/NET/ocr-processor/OCR-Images/azure_step11.png
similarity index 100%
rename from Document-Processing/Data-Extraction/OCR/NET/OCR-Images/azure_step11.png
rename to Document-Processing/Data-Extraction/NET/ocr-processor/OCR-Images/azure_step11.png
diff --git a/Document-Processing/Data-Extraction/OCR/NET/OCR-Images/azure_step12.png b/Document-Processing/Data-Extraction/NET/ocr-processor/OCR-Images/azure_step12.png
similarity index 100%
rename from Document-Processing/Data-Extraction/OCR/NET/OCR-Images/azure_step12.png
rename to Document-Processing/Data-Extraction/NET/ocr-processor/OCR-Images/azure_step12.png
diff --git a/Document-Processing/Data-Extraction/OCR/NET/OCR-Images/azure_step13.png b/Document-Processing/Data-Extraction/NET/ocr-processor/OCR-Images/azure_step13.png
similarity index 100%
rename from Document-Processing/Data-Extraction/OCR/NET/OCR-Images/azure_step13.png
rename to Document-Processing/Data-Extraction/NET/ocr-processor/OCR-Images/azure_step13.png
diff --git a/Document-Processing/Data-Extraction/OCR/NET/OCR-Images/azure_step5.png b/Document-Processing/Data-Extraction/NET/ocr-processor/OCR-Images/azure_step5.png
similarity index 100%
rename from Document-Processing/Data-Extraction/OCR/NET/OCR-Images/azure_step5.png
rename to Document-Processing/Data-Extraction/NET/ocr-processor/OCR-Images/azure_step5.png
diff --git a/Document-Processing/Data-Extraction/OCR/NET/OCR-Images/azure_step6.png b/Document-Processing/Data-Extraction/NET/ocr-processor/OCR-Images/azure_step6.png
similarity index 100%
rename from Document-Processing/Data-Extraction/OCR/NET/OCR-Images/azure_step6.png
rename to Document-Processing/Data-Extraction/NET/ocr-processor/OCR-Images/azure_step6.png
diff --git a/Document-Processing/Data-Extraction/OCR/NET/OCR-Images/azure_step7.png b/Document-Processing/Data-Extraction/NET/ocr-processor/OCR-Images/azure_step7.png
similarity index 100%
rename from Document-Processing/Data-Extraction/OCR/NET/OCR-Images/azure_step7.png
rename to Document-Processing/Data-Extraction/NET/ocr-processor/OCR-Images/azure_step7.png
diff --git a/Document-Processing/Data-Extraction/OCR/NET/OCR-Images/azure_step8.png b/Document-Processing/Data-Extraction/NET/ocr-processor/OCR-Images/azure_step8.png
similarity index 100%
rename from Document-Processing/Data-Extraction/OCR/NET/OCR-Images/azure_step8.png
rename to Document-Processing/Data-Extraction/NET/ocr-processor/OCR-Images/azure_step8.png
diff --git a/Document-Processing/Data-Extraction/OCR/NET/OCR-Images/azure_step9.png b/Document-Processing/Data-Extraction/NET/ocr-processor/OCR-Images/azure_step9.png
similarity index 100%
rename from Document-Processing/Data-Extraction/OCR/NET/OCR-Images/azure_step9.png
rename to Document-Processing/Data-Extraction/NET/ocr-processor/OCR-Images/azure_step9.png
diff --git a/Document-Processing/Data-Extraction/OCR/NET/OCR-Images/blazor_nuget_package.png b/Document-Processing/Data-Extraction/NET/ocr-processor/OCR-Images/blazor_nuget_package.png
similarity index 100%
rename from Document-Processing/Data-Extraction/OCR/NET/OCR-Images/blazor_nuget_package.png
rename to Document-Processing/Data-Extraction/NET/ocr-processor/OCR-Images/blazor_nuget_package.png
diff --git a/Document-Processing/Data-Extraction/OCR/NET/OCR-Images/blazor_server_app_creation.png b/Document-Processing/Data-Extraction/NET/ocr-processor/OCR-Images/blazor_server_app_creation.png
similarity index 100%
rename from Document-Processing/Data-Extraction/OCR/NET/OCR-Images/blazor_server_app_creation.png
rename to Document-Processing/Data-Extraction/NET/ocr-processor/OCR-Images/blazor_server_app_creation.png
diff --git a/Document-Processing/Data-Extraction/OCR/NET/OCR-Images/blazor_server_broswer_window.png b/Document-Processing/Data-Extraction/NET/ocr-processor/OCR-Images/blazor_server_broswer_window.png
similarity index 100%
rename from Document-Processing/Data-Extraction/OCR/NET/OCR-Images/blazor_server_broswer_window.png
rename to Document-Processing/Data-Extraction/NET/ocr-processor/OCR-Images/blazor_server_broswer_window.png
diff --git a/Document-Processing/Data-Extraction/OCR/NET/OCR-Images/blazor_server_configuration1.png b/Document-Processing/Data-Extraction/NET/ocr-processor/OCR-Images/blazor_server_configuration1.png
similarity index 100%
rename from Document-Processing/Data-Extraction/OCR/NET/OCR-Images/blazor_server_configuration1.png
rename to Document-Processing/Data-Extraction/NET/ocr-processor/OCR-Images/blazor_server_configuration1.png
diff --git a/Document-Processing/Data-Extraction/OCR/NET/OCR-Images/blazor_server_configuration2.png b/Document-Processing/Data-Extraction/NET/ocr-processor/OCR-Images/blazor_server_configuration2.png
similarity index 100%
rename from Document-Processing/Data-Extraction/OCR/NET/OCR-Images/blazor_server_configuration2.png
rename to Document-Processing/Data-Extraction/NET/ocr-processor/OCR-Images/blazor_server_configuration2.png
diff --git a/Document-Processing/Data-Extraction/OCR/NET/OCR-Images/create-asp.net-core-application.png b/Document-Processing/Data-Extraction/NET/ocr-processor/OCR-Images/create-asp.net-core-application.png
similarity index 100%
rename from Document-Processing/Data-Extraction/OCR/NET/OCR-Images/create-asp.net-core-application.png
rename to Document-Processing/Data-Extraction/NET/ocr-processor/OCR-Images/create-asp.net-core-application.png
diff --git a/Document-Processing/Data-Extraction/OCR/NET/OCR-Images/launch-jetbrains-rider.png b/Document-Processing/Data-Extraction/NET/ocr-processor/OCR-Images/launch-jetbrains-rider.png
similarity index 100%
rename from Document-Processing/Data-Extraction/OCR/NET/OCR-Images/launch-jetbrains-rider.png
rename to Document-Processing/Data-Extraction/NET/ocr-processor/OCR-Images/launch-jetbrains-rider.png
diff --git a/Document-Processing/Data-Extraction/OCR/NET/OCR-Images/mac_step1.png b/Document-Processing/Data-Extraction/NET/ocr-processor/OCR-Images/mac_step1.png
similarity index 100%
rename from Document-Processing/Data-Extraction/OCR/NET/OCR-Images/mac_step1.png
rename to Document-Processing/Data-Extraction/NET/ocr-processor/OCR-Images/mac_step1.png
diff --git a/Document-Processing/Data-Extraction/OCR/NET/OCR-Images/mac_step2.png b/Document-Processing/Data-Extraction/NET/ocr-processor/OCR-Images/mac_step2.png
similarity index 100%
rename from Document-Processing/Data-Extraction/OCR/NET/OCR-Images/mac_step2.png
rename to Document-Processing/Data-Extraction/NET/ocr-processor/OCR-Images/mac_step2.png
diff --git a/Document-Processing/Data-Extraction/OCR/NET/OCR-Images/mac_step3.png b/Document-Processing/Data-Extraction/NET/ocr-processor/OCR-Images/mac_step3.png
similarity index 100%
rename from Document-Processing/Data-Extraction/OCR/NET/OCR-Images/mac_step3.png
rename to Document-Processing/Data-Extraction/NET/ocr-processor/OCR-Images/mac_step3.png
diff --git a/Document-Processing/Data-Extraction/OCR/NET/OCR-Images/mac_step4.png b/Document-Processing/Data-Extraction/NET/ocr-processor/OCR-Images/mac_step4.png
similarity index 100%
rename from Document-Processing/Data-Extraction/OCR/NET/OCR-Images/mac_step4.png
rename to Document-Processing/Data-Extraction/NET/ocr-processor/OCR-Images/mac_step4.png
diff --git a/Document-Processing/Data-Extraction/OCR/NET/OCR-Images/mac_step5.png b/Document-Processing/Data-Extraction/NET/ocr-processor/OCR-Images/mac_step5.png
similarity index 100%
rename from Document-Processing/Data-Extraction/OCR/NET/OCR-Images/mac_step5.png
rename to Document-Processing/Data-Extraction/NET/ocr-processor/OCR-Images/mac_step5.png
diff --git a/Document-Processing/Data-Extraction/OCR/NET/OCR-Images/mac_step6.png b/Document-Processing/Data-Extraction/NET/ocr-processor/OCR-Images/mac_step6.png
similarity index 100%
rename from Document-Processing/Data-Extraction/OCR/NET/OCR-Images/mac_step6.png
rename to Document-Processing/Data-Extraction/NET/ocr-processor/OCR-Images/mac_step6.png
diff --git a/Document-Processing/Data-Extraction/OCR/NET/OCR-Images/mac_step7.png b/Document-Processing/Data-Extraction/NET/ocr-processor/OCR-Images/mac_step7.png
similarity index 100%
rename from Document-Processing/Data-Extraction/OCR/NET/OCR-Images/mac_step7.png
rename to Document-Processing/Data-Extraction/NET/ocr-processor/OCR-Images/mac_step7.png
diff --git a/Document-Processing/Data-Extraction/OCR/NET/Troubleshooting.md b/Document-Processing/Data-Extraction/NET/ocr-processor/Troubleshooting.md
similarity index 100%
rename from Document-Processing/Data-Extraction/OCR/NET/Troubleshooting.md
rename to Document-Processing/Data-Extraction/NET/ocr-processor/Troubleshooting.md
diff --git a/Document-Processing/Data-Extraction/OCR/NET/WPF.md b/Document-Processing/Data-Extraction/NET/ocr-processor/WPF.md
similarity index 100%
rename from Document-Processing/Data-Extraction/OCR/NET/WPF.md
rename to Document-Processing/Data-Extraction/NET/ocr-processor/WPF.md
diff --git a/Document-Processing/Data-Extraction/OCR/NET/Windows-Forms.md b/Document-Processing/Data-Extraction/NET/ocr-processor/Windows-Forms.md
similarity index 100%
rename from Document-Processing/Data-Extraction/OCR/NET/Windows-Forms.md
rename to Document-Processing/Data-Extraction/NET/ocr-processor/Windows-Forms.md
diff --git a/Document-Processing/Data-Extraction/OCR/NET/aspnet-mvc.md b/Document-Processing/Data-Extraction/NET/ocr-processor/aspnet-mvc.md
similarity index 100%
rename from Document-Processing/Data-Extraction/OCR/NET/aspnet-mvc.md
rename to Document-Processing/Data-Extraction/NET/ocr-processor/aspnet-mvc.md
diff --git a/Document-Processing/Data-Extraction/OCR/NET/azure.md b/Document-Processing/Data-Extraction/NET/ocr-processor/azure.md
similarity index 100%
rename from Document-Processing/Data-Extraction/OCR/NET/azure.md
rename to Document-Processing/Data-Extraction/NET/ocr-processor/azure.md
diff --git a/Document-Processing/Data-Extraction/OCR/NET/blazor.md b/Document-Processing/Data-Extraction/NET/ocr-processor/blazor.md
similarity index 100%
rename from Document-Processing/Data-Extraction/OCR/NET/blazor.md
rename to Document-Processing/Data-Extraction/NET/ocr-processor/blazor.md
diff --git a/Document-Processing/Data-Extraction/OCR/NET/how-to-perform-ocr-for-a-pdf-document-using-cSharp-and-VB.md b/Document-Processing/Data-Extraction/NET/ocr-processor/how-to-perform-ocr-for-a-pdf-document-using-cSharp-and-VB.md
similarity index 100%
rename from Document-Processing/Data-Extraction/OCR/NET/how-to-perform-ocr-for-a-pdf-document-using-cSharp-and-VB.md
rename to Document-Processing/Data-Extraction/NET/ocr-processor/how-to-perform-ocr-for-a-pdf-document-using-cSharp-and-VB.md
diff --git a/Document-Processing/Data-Extraction/OCR/NET/how-to-perform-ocr-for-a-pdf-document-using-net-Core.md b/Document-Processing/Data-Extraction/NET/ocr-processor/how-to-perform-ocr-for-a-pdf-document-using-net-Core.md
similarity index 100%
rename from Document-Processing/Data-Extraction/OCR/NET/how-to-perform-ocr-for-a-pdf-document-using-net-Core.md
rename to Document-Processing/Data-Extraction/NET/ocr-processor/how-to-perform-ocr-for-a-pdf-document-using-net-Core.md
diff --git a/Document-Processing/Data-Extraction/OCR/NET/net-core.md b/Document-Processing/Data-Extraction/NET/ocr-processor/net-core.md
similarity index 100%
rename from Document-Processing/Data-Extraction/OCR/NET/net-core.md
rename to Document-Processing/Data-Extraction/NET/ocr-processor/net-core.md
diff --git a/Document-Processing/Data-Extraction/OCR/NET/overview.md b/Document-Processing/Data-Extraction/NET/ocr-processor/overview.md
similarity index 98%
rename from Document-Processing/Data-Extraction/OCR/NET/overview.md
rename to Document-Processing/Data-Extraction/NET/ocr-processor/overview.md
index bf3739533d..fa050f0c11 100644
--- a/Document-Processing/Data-Extraction/OCR/NET/overview.md
+++ b/Document-Processing/Data-Extraction/NET/ocr-processor/overview.md
@@ -43,5 +43,5 @@ N> Starting with v20.1.0.x, if you reference Syncfusion® OCR proc
* Image into searchable PDF or PDF/A.
* Improved accessibility.
* Post-processing.
-* Compatible with .NET Framework 4.6.2 and above.
+* Compatible with .NET Framework 4.5 and above.
* Compatible with .NET Core 2.0 and above.
diff --git a/Document-Processing/Data-Extraction/NET/overview.md b/Document-Processing/Data-Extraction/NET/overview.md
new file mode 100644
index 0000000000..4a0026e85c
--- /dev/null
+++ b/Document-Processing/Data-Extraction/NET/overview.md
@@ -0,0 +1,286 @@
+---
+title: Extract Structured Data in .NET | Syncfusion
+description: Syncfusion® Data Extraction is a .NET library that extracts tables, forms, text, and images from documents, outputing PDFs and JSON.
+platform: document-processing
+control: DataExtraction
+documentation: UG
+keywords: Assemblies
+---
+
+# Overview of Smart Data Extractor
+
+Syncfusion® **Smart Data Extractor** is a high‑performance, deterministic C# library for extracting structured document content from **PDFs** and **images**. Tailored for modern .NET workflows, it interprets visual layout patterns—lines, boxes, labels, and alignment—to accurately identify and extract **tables**, **text elements**, **images**, **headers**, **footers**, and **form fields**. Each extracted element includes per‑field confidence scores, ensuring reliable validation, seamless export, and smooth integration into applications.
+
+## Key Features of Syncfusion® Smart Data Extractor
+
+The following list highlights the core capabilities of the Syncfusion® Smart Data Extractor:
+
+* **Document structure extraction:** detects text elements, images, headers/footers, and complete table structures (regions, header rows, columns, cell boundaries, merged cells).
+* **File format support:** works with PDF and common image formats such as JPEG and PNG.
+* **Table extraction:** specialized parsing to recover table rows, columns, header detection, and cell spans.
+* **Form recognition:** detects and extracts form fields (text inputs, checkboxes, radio buttons) with field types and values.
+* **Page‑level control:** extract data from specific pages or defined page ranges.
+* **Confidence thresholding:** filters results based on a configurable confidence score (0.0–1.0).
+* **Deterministic performance:** ensures predictable, repeatable extraction across environments including Windows, Linux, Azure, and Docker.
+
+## JSON Output Structure and Attributes
+
+The Syncfusion® Data Extraction libraries process PDFs and scanned images to extract structured document data—including tables, form fields, text elements, images, headers, and footers—by analyzing layout patterns, table regions, borders, alignment cues, and cell structures. The extracted output is returned as structured JSON with per‑field and per‑cell confidence scores, along with complete document and table hierarchies, making it ready for immediate review, export, or integration into downstream workflows.
+
+### Root Structure
+
+Below is the root structure of the JSON result:
+
+{% tabs %}
+
+{% highlight json tabtitle="JSON" %}
+
+{
+ "Pages": [
+ {
+ "PageNumber": 1,
+ "Width": 0,
+ "Height": 0,
+ "PageObjects": [],
+ "FormObjects": []
+ }
+ ]
+}
+
+{% endhighlight %}
+
+{% endtabs %}
+
+N> In the Smart Table Extractor root structure, the **FormObjects** element will not be present.
+
+### JSON Attributes
+
+#### Page Object
+
+The Page Object represents the metadata of a page along with all the detected elements it contains in the **Smart Data Extractor**, and the table elements it contains in the **Smart Table Extractor.**
+
+
+
+
+Attribute
+Type
+Description
+
+
+
+
+PageNumber
+Integer
+Sequential number of the page in the document.
+
+
+Width
+Float
+Page width in points/pixels.
+
+
+Height
+Float
+Page height in points/pixels.
+
+
+PageObjects
+Array
+List of detected objects (table).
+
+
+FormObjects
+Array
+List of detected form fields (checkboxes, text boxes, radio buttons, signatures etc.)
+
+
+
+
+N> The **FormObjects** array is not included in the **Smart Table Extractor** output structure, as it is specific to the **Smart Data Extractor** and **Smart Form Recognizer**.
+
+#### PageObjects
+
+PageObjects represent the metadata of a page along with the detected elements it contains—such as text, headers, footers, tables, images, and numbers—in the Smart Data Extractor, while in the Smart Table Extractor they represent the detected table elements on a page.
+
+
+
+
+Attribute
+Type
+Description
+
+
+
+
+Type
+String
+Defines the kind of object detected on the page (Table).
+
+
+Bounds
+Array of Floats
+The bounding box coordinates [X, Y, Width, Height] representing the object's position and size on the page.
+
+
+Content
+Object
+Holds the extracted textual content along with its style attributes (FontName, FontStyle, FontSize) that describe the appearance of the text.
+
+
+Confidence
+Float
+Confidence score (0–1) indicating the accuracy of detection.
+
+
+TableFormat (only for tables)
+Object
+Metadata about table detection, including detection score and label.
+
+
+Rows (only for tables)
+Array
+Collection of row objects that make up the table.
+
+
+
+
+#### Row Object
+
+The Row Object represents a single horizontal group of cells within a table, along with its bounding box.
+
+
+
+
+Attribute
+Type
+Description
+
+
+
+
+Type
+String
+Specifies the row type (for example, tr).
+
+
+Rect
+Array
+Bounding box coordinates for the row.
+
+
+Cells
+Array
+Collection of cell objects contained in the row.
+
+
+
+
+#### Cell Object
+
+The Cell Object represents an individual table entry, containing text values, spanning details, and positional coordinates.
+
+
+
+
+Attribute
+Type
+Description
+
+
+
+
+Type
+String
+Cell type (e.g., td).
+
+
+Rect
+Array
+Bounding box coordinates for the cell.
+
+
+RowSpan / ColSpan
+Integer
+Number of rows or columns spanned by the cell.
+
+
+RowStart / ColStart
+Integer
+Starting row and column index of the cell.
+
+
+Content.Value
+String
+Text content inside the cell.
+
+
+
+
+#### FormObjects
+
+FormObjects represent interactive form fields detected on the page, such as text boxes, checkboxes, radio buttons, and signature regions.Each object includes positional data, dimensions, field type, and a confidence score that indicates detection reliability.
+
+
+
+
+Attribute
+Type
+Description
+
+
+
+
+X / Y
+Float
+Coordinates of the form field on the page.
+
+
+Width / Height
+Float
+Dimensions of the form field.
+
+
+Type
+Integer
+Numeric identifier for the form field type (for example, 0 = TextArea, 1 = Checkbox, 2 = Radio Button, 3 = Signature).
+
+
+Confidence
+Float
+Confidence score (0–1) indicating detection accuracy.
+
+
+
+
+N> The **FormObjects** structure is not available in the Smart Table Extractor output.
+
+### Text Attribute
+
+Represents the text formatting attributes (font family, font style, font size) applied to the extracted text.
+
+
+
+Attribute
+Type
+Description
+
+
+
+
+FontName
+String
+Specifies the font family name used for the text (for example, "Arial").
+
+
+FontStyle
+Integer
+Specifies the numeric identifier for the font style (for example, 0 = Regular, 1 = Bold, 2 = Italic).
+
+
+
+FontSize
+Float
+Specifies the font size used for the text.
+
+
+
diff --git a/Document-Processing/Data-Extraction/Smart-Data-Extractor/NET/troubleshooting.md b/Document-Processing/Data-Extraction/NET/troubleshooting.md
similarity index 81%
rename from Document-Processing/Data-Extraction/Smart-Data-Extractor/NET/troubleshooting.md
rename to Document-Processing/Data-Extraction/NET/troubleshooting.md
index 72552ce853..b0435428a6 100644
--- a/Document-Processing/Data-Extraction/Smart-Data-Extractor/NET/troubleshooting.md
+++ b/Document-Processing/Data-Extraction/NET/troubleshooting.md
@@ -1,12 +1,12 @@
---
-title: Troubleshoot SmartDataExtractor in DataExtractor | Syncfusion
-description: Troubleshooting steps and FAQs for Syncfusion SmartDataExtractor to resolve common errors in .NET Framework projects.
+title: Troubleshoot SmartDataExtractor and SmartTableExtractor in DataExtractor | Syncfusion
+description: Troubleshooting steps and FAQs for Syncfusion SmartDataExtractor and SmartTableExtractor to resolve common errors in .NET Framework projects.
platform: document-processing
-control: SmartDataExtractor
+control: SmartDataExtractor, SmartTableExtractor
documentation: UG
---
-# Troubleshooting and FAQ for Smart Data Extractor
+# Troubleshooting and FAQ for Smart Data Extractor and Smart Table Extractor
## ONNX file missing
@@ -17,16 +17,20 @@ documentation: UG
Reason
- The required ONNX model files are not copied into the application’s build output.
+ The required ONNX model files are not copied into the application's build output.
Solution
Ensure that the runtimes folder is copied properly to the bin folder of the application from the NuGet package location.
- Please refer to the below screenshot,
+ Please refer to the screenshots below:
-
+ Smart Data Extractor:
+
+
+ Smart Table Extractor:
+
Note: If you publish your application, ensure the runtimes/models folder and ONNX files are included in the publish output.
@@ -92,7 +96,5 @@ documentation: UG
{% endhighlight %}
{% endtabs %}
-
-
-
-
+
+
\ No newline at end of file
diff --git a/Document-Processing/Data-Extraction/Smart-Data-Extractor/NET/Features.md b/Document-Processing/Data-Extraction/NET/working-with-data-extraction.md
similarity index 51%
rename from Document-Processing/Data-Extraction/Smart-Data-Extractor/NET/Features.md
rename to Document-Processing/Data-Extraction/NET/working-with-data-extraction.md
index 5ac2cc3755..ccf5c36ed3 100644
--- a/Document-Processing/Data-Extraction/Smart-Data-Extractor/NET/Features.md
+++ b/Document-Processing/Data-Extraction/NET/working-with-data-extraction.md
@@ -1,181 +1,183 @@
---
-title: Features of Smart data Extractor | Syncfusion
-description: Discover the key features of Syncfusion Smart Data Extractor, a .NET library for extracting tables, forms, text, and images.
+title: Working with Data Extraction | Syncfusion
+description: Syncfusion® Smart Data Extractor is a .NET library that extracts text, tables, forms, and images from PDF and image files with structured outputs.
platform: document-processing
control: SmartDataExtractor
documentation: UG
keywords: Assemblies
+
---
-# Smart Data Extractor Features
+# Working with Data Extraction
-## Extract Data from a PDF Document
+## Extract Data as JSON from PDF or Image
-To extract structured data such as text, form fields, tables and images from an entire PDF document using the **ExtractDataAsPdfDocument** method of the **DataExtractor** class, refer to the following code example:
+The **Smart Data Extractor** enables you to process PDF documents or scanned images and export the structured content as JSON.
+This section covers two scenarios:
+- Extracting data as JSON from a **PDF document**.
+- Extracting data as JSON from an **image**.
-{% tabs %}
+### Extract Data as JSON from PDF
-{% highlight c# tabtitle="C# [Cross-platform]" %}
+To extract structured data from a PDF document using the [ExtractDataAsJson](https://help.syncfusion.com/cr/document-processing/Syncfusion.SmartDataExtractor.DataExtractor.html#Syncfusion_SmartDataExtractor_DataExtractor_ExtractDataAsJson_System_IO_Stream_) method of the [DataExtractor](https://help.syncfusion.com/cr/document-processing/Syncfusion.SmartDataExtractor.DataExtractor.html) class, refer to the following code example:
-using System.IO;
-using Syncfusion.Pdf.Parsing;
+{% tabs %}
+
+{% highlight c# tabtitle="C# [Cross-platform]" playgroundButtonLink="https://raw.githubusercontent.com/SyncfusionExamples/PDF-Examples/refs/heads/master/Data-Extraction/Smart-Data-Extractor/Extract-data-as-JSON-from-PDF/.NET/Extract-data-as-JSON-from-PDF-document/Program.cs" %}
+
+using System.Text;
using Syncfusion.SmartDataExtractor;
//Open the input PDF file as a stream.
-using (FileStream inputStream = new FileStream("Input.pdf", FileMode.Open, FileAccess.Read, FileShare.Read))
+using (FileStream stream = new FileStream("Input.pdf", FileMode.Open, FileAccess.Read))
{
- //Initialize the Smart Data Extractor.
+ //Initialize the Data Extractor.
DataExtractor extractor = new DataExtractor();
- //Extract data and return as a loaded PDF document.
- PdfLoadedDocument document = extractor.ExtractDataAsPdfDocument(inputStream);
- //Save the extracted output as a new PDF file.
- document.Save("Output.pdf");
- //Close the document to release resources.
- document.Close(true);
+ //Extract data as JSON.
+ string data = extractor.ExtractDataAsJson(stream);
+ //Save the extracted JSON data into an output file.
+ File.WriteAllText("Output.json", data, Encoding.UTF8);
}
{% endhighlight %}
{% highlight c# tabtitle="C# [Windows-specific]" %}
-using System.IO;
-using Syncfusion.Pdf.Parsing;
+using System.Text;
using Syncfusion.SmartDataExtractor;
//Open the input PDF file as a stream.
-using (FileStream inputStream = new FileStream("Input.pdf", FileMode.Open, FileAccess.Read, FileShare.Read))
+using (FileStream stream = new FileStream("Input.pdf", FileMode.Open, FileAccess.Read))
{
- //Initialize the Smart Data Extractor.
- DataExtractor extractor = new DataExtractor();
- //Extract data and return as a loaded PDF document.
- PdfLoadedDocument document = extractor.ExtractDataAsPdfDocument(inputStream);
- //Save the extracted output as a new PDF file.
- document.Save("Output.pdf");
- //Close the document to release resources.
- document.Close(true);
+ //Initialize the Data Extractor.
+ DataExtractor extractor = new DataExtractor();
+ //Extract data as JSON.
+ string data = extractor.ExtractDataAsJson(stream);
+ //Save the extracted JSON data into an output file.
+ File.WriteAllText("Output.json", data, Encoding.UTF8);
}
-
+
{% endhighlight %}
-{% endtabs %}
+{% endtabs %}
+
+You can download a complete working sample from [GitHub](https://github.com/SyncfusionExamples/PDF-Examples/tree/master/Data-Extraction/Smart-Data-Extractor/Extract-data-as-JSON-from-PDF/.NET).
-## Extract Data as Stream
+### Extract Data as JSON from an Image
-To extract structured data from a PDF document and return the output as a stream using the **ExtractDataAsPdfStream** method of the **DataExtractor** class, refer to the following example.
+To extract structured data from an image document using the [ExtractDataAsJson](https://help.syncfusion.com/cr/document-processing/Syncfusion.SmartDataExtractor.DataExtractor.html#Syncfusion_SmartDataExtractor_DataExtractor_ExtractDataAsJson_System_IO_Stream_) method of the [DataExtractor](https://help.syncfusion.com/cr/document-processing/Syncfusion.SmartDataExtractor.DataExtractor.html) class, refer to the following code examples.
{% tabs %}
{% highlight c# tabtitle="C# [Cross-platform]" %}
-using System.IO;
+using System.Text;
using Syncfusion.SmartDataExtractor;
-//Open the input PDF file as a stream.
-using (FileStream inputStream = new FileStream("Input.pdf", FileMode.Open, FileAccess.Read, FileShare.Read))
+//Open the input image file as a stream.
+using (FileStream stream = new FileStream("Image.png", FileMode.Open, FileAccess.Read))
{
- //Initialize the Smart Data Extractor.
+ //Initialize the Data Extractor.
DataExtractor extractor = new DataExtractor();
- //Extract data and return as a PDF stream.
- Stream pdfStream = extractor.ExtractDataAsPdfStream(inputStream);
-
- //Save the extracted PDF stream into an output file.
- using (FileStream outputStream = new FileStream("Output.pdf", FileMode.Create, FileAccess.Write))
- {
- pdfStream.CopyTo(outputStream);
- }
+ //Extract data as JSON from the image stream.
+ string data = extractor.ExtractDataAsJson(stream);
+ //Save the extracted JSON data into an output file.
+ File.WriteAllText("Output.json", data, Encoding.UTF8);
}
-
+
{% endhighlight %}
{% highlight c# tabtitle="C# [Windows-specific]" %}
-using System.IO;
using Syncfusion.SmartDataExtractor;
+using System.Text;
-//Open the input PDF file as a stream.
-using (FileStream inputStream = new FileStream("Input.pdf", FileMode.Open, FileAccess.Read, FileShare.Read))
+//Open the input image file as a stream.
+using (FileStream stream = new FileStream("Image.png", FileMode.Open, FileAccess.Read))
{
- //Initialize the Smart Data Extractor.
+ //Initialize the Data Extractor.
DataExtractor extractor = new DataExtractor();
- //Extract data and return as a PDF stream.
- Stream pdfStream = extractor.ExtractDataAsPdfStream(inputStream);
-
- //Save the extracted PDF stream into an output file.
- using (FileStream outputStream = new FileStream("Output.pdf", FileMode.Create, FileAccess.Write))
- {
- pdfStream.CopyTo(outputStream);
- }
+ //Extract data as JSON from the image stream.
+ string data = extractor.ExtractDataAsJson(stream);
+ //Save the extracted JSON data into an output file.
+ File.WriteAllText("Output.json", data, Encoding.UTF8);
}
-
+
{% endhighlight %}
-{% endtabs %}
+{% endtabs %}
+
+You can download a complete working sample from [GitHub](https://github.com/SyncfusionExamples/PDF-Examples/tree/master/Data-Extraction/Smart-Data-Extractor/Extract-data-as-JSON-from-an-Image/.NET).
+
+## Extract Data as Markdown from PDF or Image
-## Extract Data as JSON from PDF Document
+The **Smart Data Extractor** enables you to process PDF documents or scanned images and export the structured content as Markdown (MD).
-To extract form fields across a PDF document using the **ExtractDataAsJson** method of the **DataExtractor** class, refer to the following code example:
+This section covers two scenarios:
+- Extracting data as Markdown from a PDF document.
+- Extracting data as Markdown from an image.
+
+### Extract Data as Markdown from PDF
+
+To extract structured data from a PDF document using the [ExtractDataAsMarkdown](https://help.syncfusion.com/cr/document-processing/Syncfusion.SmartDataExtractor.DataExtractor.html#Syncfusion_SmartDataExtractor_DataExtractor_ExtractDataAsMarkdown_System_IO_Stream_) method of the [DataExtractor](https://help.syncfusion.com/cr/document-processing/Syncfusion.SmartDataExtractor.DataExtractor.html) class, refer to the following code example:
{% tabs %}
{% highlight c# tabtitle="C# [Cross-platform]" %}
-using System.IO;
-using Syncfusion.SmartDataExtractor;
-using Syncfusion.SmartFormRecognizer;
using System.Text;
+using Syncfusion.SmartDataExtractor;
//Open the input PDF file as a stream.
using (FileStream stream = new FileStream("Input.pdf", FileMode.Open, FileAccess.Read))
{
- //Initialize the Smart Data Extractor.
+ //Initialize the Data Extractor.
DataExtractor extractor = new DataExtractor();
- //Extract data as JSON.
- string data = extractor.ExtractDataAsJson(stream);
- //Save the extracted JSON data into an output file.
- File.WriteAllText("Output.json", data, Encoding.UTF8);
+ //Extract data as Markdown.
+ string data = extractor.ExtractDataAsMarkdown(stream);
+ //Save the extracted Markdown data into an output file.
+ File.WriteAllText("Output.md", data, Encoding.UTF8);
}
{% endhighlight %}
{% highlight c# tabtitle="C# [Windows-specific]" %}
-using System.IO;
using Syncfusion.SmartDataExtractor;
-using Syncfusion.SmartFormRecognizer;
using System.Text;
//Open the input PDF file as a stream.
using (FileStream stream = new FileStream("Input.pdf", FileMode.Open, FileAccess.Read))
{
- //Initialize the Smart Data Extractor.
- DataExtractor extractor = new DataExtractor();
- //Extract data as JSON.
- string data = extractor.ExtractDataAsJson(stream);
- //Save the extracted JSON data into an output file.
- File.WriteAllText("Output.json", data, Encoding.UTF8);
+ //Initialize the Data Extractor.
+ DataExtractor extractor = new DataExtractor();
+ //Extract data as Markdown.
+ string data = extractor.ExtractDataAsMarkdown(stream);
+ //Save the extracted Markdown data into an output file.
+ File.WriteAllText("Output.md", data, Encoding.UTF8);
}
{% endhighlight %}
{% endtabs %}
-## Extract Data as Markdown from PDF Document
+You can download a complete working sample from [GitHub](https://github.com/SyncfusionExamples/PDF-Examples/tree/master/Data-Extraction/Smart-Data-Extractor/Extract-data-as-MD-from-PDF/.NET).
-To extract form fields across a PDF document using the **ExtractDataAsMarkdown** method of the **DataExtractor** class, refer to the following code example:
+### Extract Data as Markdown from Image
+
+To extract structured data from an image file using the [ExtractDataAsMarkdown](https://help.syncfusion.com/cr/document-processing/Syncfusion.SmartDataExtractor.DataExtractor.html#Syncfusion_SmartDataExtractor_DataExtractor_ExtractDataAsMarkdown_System_IO_Stream_) method of the [DataExtractor](https://help.syncfusion.com/cr/document-processing/Syncfusion.SmartDataExtractor.DataExtractor.html) class, refer to the following code example:
{% tabs %}
{% highlight c# tabtitle="C# [Cross-platform]" %}
-using System.IO;
-using Syncfusion.SmartDataExtractor;
-using Syncfusion.SmartFormRecognizer;
using System.Text;
+using Syncfusion.SmartDataExtractor;
-//Open the input PDF file as a stream.
-using (FileStream stream = new FileStream("Input.pdf", FileMode.Open, FileAccess.Read))
+//Open the input image file as a stream.
+using (FileStream stream = new FileStream("Input.png", FileMode.Open, FileAccess.Read))
{
- //Initialize the Smart Data Extractor.
+ //Initialize the Data Extractor.
DataExtractor extractor = new DataExtractor();
//Extract data as Markdown.
string data = extractor.ExtractDataAsMarkdown(stream);
@@ -187,100 +189,161 @@ using (FileStream stream = new FileStream("Input.pdf", FileMode.Open, FileAccess
{% highlight c# tabtitle="C# [Windows-specific]" %}
-using System.IO;
-using Syncfusion.SmartDataExtractor;
-using Syncfusion.SmartFormRecognizer;
using System.Text;
+using Syncfusion.SmartDataExtractor;
-//Open the input PDF file as a stream.
-using (FileStream stream = new FileStream("Input.pdf", FileMode.Open, FileAccess.Read))
+//Open the input image file as a stream.
+using (FileStream stream = new FileStream("Input.png", FileMode.Open, FileAccess.Read))
{
- //Initialize the Smart Data Extractor.
+ //Initialize the Data Extractor.
DataExtractor extractor = new DataExtractor();
//Extract data as Markdown.
string data = extractor.ExtractDataAsMarkdown(stream);
//Save the extracted Markdown data into an output file.
File.WriteAllText("Output.md", data, Encoding.UTF8);
}
+
{% endhighlight %}
{% endtabs %}
-## Extract Data as JSON from an Image
-To extract structured data from an image document using the **ExtractDataAsJson** method of the **DataExtractor** class, refer to the following code examples.
+## Extract Data from PDF or Image and Save as Digital PDF
-{% tabs %}
+The **Smart Data Extractor** allows you to process PDF documents or scanned images and generate a digital PDF output.
+
+In this section, you will learn how to:
+- Extract structured content and save it directly as a **PDF document**.
+- Work with the extracted content as a **PDF stream** for flexible storage or further processing.
+
+
+### Extract Data from PDF Document
+
+To extract structured data such as text, form fields, tables and images from an entire PDF document using the [ExtractDataAsPdfDocument](https://help.syncfusion.com/cr/document-processing/Syncfusion.SmartDataExtractor.DataExtractor.html#Syncfusion_SmartDataExtractor_DataExtractor_ExtractDataAsPdfDocument_System_IO_Stream_) method of the [DataExtractor](https://help.syncfusion.com/cr/document-processing/Syncfusion.SmartDataExtractor.DataExtractor.html) class, refer to the following code example:
+
+{% tabs %}
{% highlight c# tabtitle="C# [Cross-platform]" %}
-using System.IO;
+using Syncfusion.Pdf.Parsing;
using Syncfusion.SmartDataExtractor;
-using System.Text;
-//Open the input image file as a stream.
-using (FileStream stream = new FileStream("Image.png", FileMode.Open, FileAccess.Read))
+//Open the input PDF file as a stream.
+using (FileStream inputStream = new FileStream("Input.pdf", FileMode.Open, FileAccess.Read, FileShare.Read))
{
//Initialize the Data Extractor.
DataExtractor extractor = new DataExtractor();
- //Extract data as JSON from the image stream.
- string data = extractor.ExtractDataAsJson(stream);
- //Save the extracted JSON data into an output file.
- File.WriteAllText("Output.json", data, Encoding.UTF8);
+ //Extract data and return as a loaded PDF document.
+ PdfLoadedDocument document = extractor.ExtractDataAsPdfDocument(inputStream);
+ //Save the extracted output as a new PDF file.
+ document.Save("Output.pdf");
+ //Close the document.
+ document.Close(true);
}
{% endhighlight %}
{% highlight c# tabtitle="C# [Windows-specific]" %}
-using System.IO;
+using Syncfusion.Pdf.Parsing;
using Syncfusion.SmartDataExtractor;
-using System.Text;
-//Open the input image file as a stream.
-using (FileStream stream = new FileStream("Image.png", FileMode.Open, FileAccess.Read))
+//Open the input PDF file as a stream.
+using (FileStream inputStream = new FileStream("Input.pdf", FileMode.Open, FileAccess.Read, FileShare.Read))
{
//Initialize the Data Extractor.
DataExtractor extractor = new DataExtractor();
- //Extract data as JSON from the image stream.
- string data = extractor.ExtractDataAsJson(stream);
- //Save the extracted JSON data into an output file.
- File.WriteAllText("Output.json", data, Encoding.UTF8);
+ //Extract data and return as a loaded PDF document.
+ PdfLoadedDocument document = extractor.ExtractDataAsPdfDocument(inputStream);
+ //Save the extracted output as a new PDF file.
+ document.Save("Output.pdf");
+ //Close the document.
+ document.Close(true);
}
{% endhighlight %}
-{% endtabs %}
+{% endtabs %}
+
+You can download a complete working sample from [GitHub](https://github.com/SyncfusionExamples/PDF-Examples/tree/master/Data-Extraction/Smart-Data-Extractor/Extract-data-from-PDF-document/.NET).
-## Form Detection
+### Extract Data as Stream
-To extract form fields across a PDF document and save them as a PDF output using the **ExtractDataAsPdfDocument** method of the **DataExtractor** class with form recognition options, refer to the following code example:
+To extract structured data from a PDF document and return the output as a stream using the [ExtractDataAsPdfStream](https://help.syncfusion.com/cr/document-processing/Syncfusion.SmartDataExtractor.DataExtractor.html#Syncfusion_SmartDataExtractor_DataExtractor_ExtractDataAsPdfStream_System_IO_Stream_) method of the [DataExtractor](https://help.syncfusion.com/cr/document-processing/Syncfusion.SmartDataExtractor.DataExtractor.html) class, refer to the following example.
+
+{% tabs %}
+
+{% highlight c# tabtitle="C# [Cross-platform]" %}
+
+using Syncfusion.SmartDataExtractor;
+
+//Open the input PDF file as a stream.
+using (FileStream inputStream = new FileStream("Input.pdf", FileMode.Open, FileAccess.Read, FileShare.Read))
+{
+ //Initialize the Data Extractor.
+ DataExtractor extractor = new DataExtractor();
+ //Extract data and return as a PDF stream.
+ Stream pdfStream = extractor.ExtractDataAsPdfStream(inputStream);
+ //Save the extracted PDF stream into an output file.
+ using (FileStream outputStream = new FileStream("Output.pdf", FileMode.Create, FileAccess.Write))
+ {
+ pdfStream.CopyTo(outputStream);
+ }
+}
+
+{% endhighlight %}
+
+{% highlight c# tabtitle="C# [Windows-specific]" %}
+
+using Syncfusion.SmartDataExtractor;
+
+//Open the input PDF file as a stream.
+using (FileStream inputStream = new FileStream("Input.pdf", FileMode.Open, FileAccess.Read, FileShare.Read))
+{
+ //Initialize the Data Extractor.
+ DataExtractor extractor = new DataExtractor();
+ //Extract data and return as a PDF stream.
+ Stream pdfStream = extractor.ExtractDataAsPdfStream(inputStream);
+
+ //Save the extracted PDF stream into an output file.
+ using (FileStream outputStream = new FileStream("Output.pdf", FileMode.Create, FileAccess.Write))
+ {
+ pdfStream.CopyTo(outputStream);
+ }
+}
+
+{% endhighlight %}
+
+{% endtabs %}
+
+You can download a complete working sample from [GitHub](https://github.com/SyncfusionExamples/PDF-Examples/tree/master/Data-Extraction/Smart-Data-Extractor/Extract-data-as-stream/.NET).
+
+
+## Disable Form Detection
+
+To disable form field detection while extracting structured data from a PDF document using the [ExtractDataAsJson](https://help.syncfusion.com/cr/document-processing/Syncfusion.SmartDataExtractor.DataExtractor.html#Syncfusion_SmartDataExtractor_DataExtractor_ExtractDataAsJson_System_IO_Stream_) method of the [DataExtractor](https://help.syncfusion.com/cr/document-processing/Syncfusion.SmartDataExtractor.DataExtractor.html) class, refer to the following code example:
{% tabs %}
{% highlight c# tabtitle="C# [Cross-platform]" %}
-using System.IO;
using Syncfusion.Pdf.Parsing;
using Syncfusion.SmartDataExtractor;
-using Syncfusion.SmartFormRecognizer;
//Open the input PDF file as a stream.
using (FileStream stream = new FileStream("Input.pdf", FileMode.Open, FileAccess.Read))
{
- //Initialize the Smart Data Extractor.
+ //Initialize the Data Extractor.
DataExtractor extractor = new DataExtractor();
-
- //Enable form detection in the document to identify form fields.
+ //Disable form detection in the document to identify form fields.
//By default - true
extractor.EnableFormDetection = false;
- //Extract form data and return as a loaded PDF document.
- PdfLoadedDocument pdf = extractor.ExtractDataAsPdfDocument(stream);
-
- //Save the extracted output as a new PDF file.
- pdf.Save("Output.pdf");
- //Close the document to release resources.
+ //Extract form data and return as a loaded json file.
+ PdfLoadedDocument pdf = extractor.ExtractDataAsJson(stream);
+ //Save the extracted output as a new json file.
+ pdf.Save("Output.json");
+ //Close the document.
pdf.Close(true);
}
@@ -289,26 +352,22 @@ using (FileStream stream = new FileStream("Input.pdf", FileMode.Open, FileAccess
{% highlight c# tabtitle="C# [Windows-specific]" %}
-using System.IO;
using Syncfusion.Pdf.Parsing;
using Syncfusion.SmartDataExtractor;
-using Syncfusion.SmartFormRecognizer;
//Open the input PDF file as a stream.
using (FileStream stream = new FileStream("Input.pdf", FileMode.Open, FileAccess.Read))
{
- //Initialize the Smart Data Extractor.
+ //Initialize the Data Extractor.
DataExtractor extractor = new DataExtractor();
-
- //Enable form detection in the document to identify form fields.
+ //Disable form detection in the document to identify form fields.
//By default - true
extractor.EnableFormDetection = false;
- //Extract form data and return as a loaded PDF document.
- PdfLoadedDocument pdf = extractor.ExtractDataAsPdfDocument(stream);
-
- //Save the extracted output as a new PDF file.
- pdf.Save("Output.pdf");
- //Close the document to release resources.
+ //Extract form data and return as a loaded json file.
+ PdfLoadedDocument pdf = extractor.ExtractDataAsJson(stream);
+ //Save the extracted output as a new json file.
+ pdf.Save("Output.json");
+ //Close the document.
pdf.Close(true);
}
@@ -316,34 +375,32 @@ using (FileStream stream = new FileStream("Input.pdf", FileMode.Open, FileAccess
{% endtabs %}
-## Table Detection
+You can download a complete working sample from [GitHub](https://github.com/SyncfusionExamples/PDF-Examples/tree/master/Data-Extraction/Smart-Data-Extractor/Disable-Form-detection/.NET).
+
+## Disable Table detection
-To extract tables across a PDF document and save them as a PDF output using the **ExtractDataAsPdfDocument** method of the **DataExtractor** class with table extraction options, refer to the following code example:
+To disable table detection while extracting structured data from a PDF document using the [ExtractDataAsJson](https://help.syncfusion.com/cr/document-processing/Syncfusion.SmartDataExtractor.DataExtractor.html#Syncfusion_SmartDataExtractor_DataExtractor_ExtractDataAsJson_System_IO_Stream_) method of the [DataExtractor](https://help.syncfusion.com/cr/document-processing/Syncfusion.SmartDataExtractor.DataExtractor.html) class, refer to the following code example:
{% tabs %}
{% highlight c# tabtitle="C# [Cross-platform]" %}
-using System.IO;
using Syncfusion.Pdf.Parsing;
using Syncfusion.SmartDataExtractor;
-using Syncfusion.SmartTableExtractor;
// Load the input PDF file.
using (FileStream stream = new FileStream("Input.pdf", FileMode.Open, FileAccess.Read))
{
- // Initialize the Smart Data Extractor.
+ // Initialize the Data Extractor.
DataExtractor extractor = new DataExtractor();
-
- // Enable table detection and set confidence threshold.
+ // Disable table detection.
//By default - true
extractor.EnableTableDetection = false;
- // Extract data and return as a loaded PDF document.
- PdfLoadedDocument pdf = extractor.ExtractDataAsPdfDocument(stream);
-
- // Save the extracted output as a new PDF file.
- pdf.Save("Output.pdf");
- // Close the document to release resources.
+ // Extract data and return as a loaded json document.
+ PdfLoadedDocument pdf = extractor.ExtractDataAsJson(stream);
+ // Save the extracted output as a new json file.
+ pdf.Save("Output.json");
+ // Close the document.
pdf.Close(true);
}
@@ -352,26 +409,22 @@ using (FileStream stream = new FileStream("Input.pdf", FileMode.Open, FileAccess
{% highlight c# tabtitle="C# [Windows-specific]" %}
-using System.IO;
using Syncfusion.Pdf.Parsing;
using Syncfusion.SmartDataExtractor;
-using Syncfusion.SmartTableExtractor;
// Load the input PDF file.
using (FileStream stream = new FileStream("Input.pdf", FileMode.Open, FileAccess.Read))
{
- // Initialize the Smart Data Extractor.
+ // Initialize the Data Extractor.
DataExtractor extractor = new DataExtractor();
-
- // Enable table detection and set confidence threshold.
+ // Disable table detection.
//By default - true
extractor.EnableTableDetection = false;
- // Extract data and return as a loaded PDF document.
- PdfLoadedDocument pdf = extractor.ExtractDataAsPdfDocument(stream);
-
- // Save the extracted output as a new PDF file.
- pdf.Save("Output.pdf");
- // Close the document to release resources.
+ // Extract data and return as a loaded json file.
+ PdfLoadedDocument pdf = extractor.ExtractDataAsJson(stream);
+ // Save the extracted output as a new json file.
+ pdf.Save("Output.json");
+ // Close the document.
pdf.Close(true);
}
@@ -379,15 +432,16 @@ using (FileStream stream = new FileStream("Input.pdf", FileMode.Open, FileAccess
{% endtabs %}
-## Extract Data with different Form Recognizer options
+You can download a complete working sample from [GitHub](https://github.com/SyncfusionExamples/PDF-Examples/tree/master/Data-Extraction/Smart-Data-Extractor/Disable-Table-detection/.NET).
+
+## Extract Data with Form Recognizer options
-To extract structured data from a PDF document using different Form Recognizer options with the **ExtractDataAsPdfDocument** method of the **DataExtractor** class, refer to the following code example:
+To extract structured data from a PDF document using different Form Recognizer options with the [ExtractDataAsJson](https://help.syncfusion.com/cr/document-processing/Syncfusion.SmartDataExtractor.DataExtractor.html#Syncfusion_SmartDataExtractor_DataExtractor_ExtractDataAsJson_System_IO_Stream_) method of the [DataExtractor](https://help.syncfusion.com/cr/document-processing/Syncfusion.SmartDataExtractor.DataExtractor.html) class, refer to the following code example:
{% tabs %}
{% highlight c# tabtitle="C# [Cross-platform]" %}
-using System.IO;
using Syncfusion.Pdf.Parsing;
using Syncfusion.SmartDataExtractor;
using Syncfusion.SmartFormRecognizer;
@@ -395,12 +449,10 @@ using Syncfusion.SmartFormRecognizer;
//Open the input PDF file as a stream.
using (FileStream stream = new FileStream("Input.pdf", FileMode.Open, FileAccess.Read))
{
- //Initialize the Smart Data Extractor.
+ //Initialize the Data Extractor.
DataExtractor extractor = new DataExtractor();
-
//Enable form detection in the document to identify form fields.
extractor.EnableFormDetection = true;
-
//Configure form recognition options for advanced detection.
FormRecognizeOptions formOptions = new FormRecognizeOptions();
//Recognize forms across pages 1 to 5 in the document.
@@ -417,13 +469,11 @@ using (FileStream stream = new FileStream("Input.pdf", FileMode.Open, FileAccess
formOptions.DetectRadioButtons = true;
//Assign the configured form recognition options to the extractor.
extractor.FormRecognizeOptions = formOptions;
-
- //Extract form data and return as a loaded PDF document.
- PdfLoadedDocument pdf = extractor.ExtractDataAsPdfDocument(stream);
-
- //Save the extracted output as a new PDF file.
- pdf.Save("Output.pdf");
- //Close the document to release resources.
+ //Extract form data and return as a loaded json file.
+ PdfLoadedDocument pdf = extractor.ExtractDataAsJson(stream);
+ //Save the extracted output as a new json file.
+ pdf.Save("Output.json");
+ //Close the document.
pdf.Close(true);
}
@@ -431,7 +481,6 @@ using (FileStream stream = new FileStream("Input.pdf", FileMode.Open, FileAccess
{% highlight c# tabtitle="C# [Windows-specific]" %}
-using System.IO;
using Syncfusion.Pdf.Parsing;
using Syncfusion.SmartDataExtractor;
using Syncfusion.SmartFormRecognizer;
@@ -439,12 +488,10 @@ using Syncfusion.SmartFormRecognizer;
//Open the input PDF file as a stream.
using (FileStream stream = new FileStream("Input.pdf", FileMode.Open, FileAccess.Read))
{
- //Initialize the Smart Data Extractor.
+ //Initialize the Data Extractor.
DataExtractor extractor = new DataExtractor();
-
//Enable form detection in the document to identify form fields.
extractor.EnableFormDetection = true;
-
//Configure form recognition options for advanced detection.
FormRecognizeOptions formOptions = new FormRecognizeOptions();
//Recognize forms across pages 1 to 5 in the document.
@@ -461,13 +508,11 @@ using (FileStream stream = new FileStream("Input.pdf", FileMode.Open, FileAccess
formOptions.DetectRadioButtons = true;
//Assign the configured form recognition options to the extractor.
extractor.FormRecognizeOptions = formOptions;
-
- //Extract form data and return as a loaded PDF document.
- PdfLoadedDocument pdf = extractor.ExtractDataAsPdfDocument(stream);
-
- //Save the extracted output as a new PDF file.
- pdf.Save("Output.pdf");
- //Close the document to release resources.
+ //Extract form data and return as a loaded json document.
+ PdfLoadedDocument pdf = extractor.ExtractDataAsJson(stream);
+ //Save the extracted output as a new json file.
+ pdf.Save("Output.json");
+ //Close the document.
pdf.Close(true);
}
@@ -475,14 +520,16 @@ using (FileStream stream = new FileStream("Input.pdf", FileMode.Open, FileAccess
{% endtabs %}
-## Extract Data with different Table Extraction options
+You can download a complete working sample from [GitHub](https://github.com/SyncfusionExamples/PDF-Examples/tree/master/Data-Extraction/Smart-Data-Extractor/Different-form-recognizer-options/.NET).
+
+## Extract Data with Table Extraction options
-To extract structured table data from a PDF document using advanced Table Extraction options with the **ExtractDataAsPdfDocument** method of the **DataExtractor** class, refer to the following code example:
+To extract structured table data from a PDF document using advanced Table Extraction options with the [ExtractDataAsJson](https://help.syncfusion.com/cr/document-processing/Syncfusion.SmartDataExtractor.DataExtractor.html#Syncfusion_SmartDataExtractor_DataExtractor_ExtractDataAsJson_System_IO_Stream_) method of the [DataExtractor](https://help.syncfusion.com/cr/document-processing/Syncfusion.SmartDataExtractor.DataExtractor.html) class, refer to the following code example:
{% tabs %}
{% highlight c# tabtitle="C# [Cross-platform]" %}
-using System.IO;
+
using Syncfusion.Pdf.Parsing;
using Syncfusion.SmartDataExtractor;
using Syncfusion.SmartTableExtractor;
@@ -490,12 +537,10 @@ using Syncfusion.SmartTableExtractor;
// Load the input PDF file.
using (FileStream stream = new FileStream("Input.pdf", FileMode.Open, FileAccess.Read))
{
- // Initialize the Smart Data Extractor.
+ // Initialize the Data Extractor.
DataExtractor extractor = new DataExtractor();
-
// Enable table detection and set confidence threshold.
extractor.EnableTableDetection = true;
-
// Configure table extraction options.
TableExtractionOptions tableOptions = new TableExtractionOptions();
// Extract tables across pages 1 to 5.
@@ -506,12 +551,11 @@ using (FileStream stream = new FileStream("Input.pdf", FileMode.Open, FileAccess
tableOptions.DetectBorderlessTables = true;
// Assign the table extraction options to the extractor.
extractor.TableExtractionOptions = tableOptions;
- // Extract data and return as a loaded PDF document.
- PdfLoadedDocument pdf = extractor.ExtractDataAsPdfDocument(stream);
-
- // Save the extracted output as a new PDF file.
- pdf.Save("Output.pdf");
- // Close the document to release resources.
+ // Extract data and return as a loaded json file.
+ PdfLoadedDocument pdf = extractor.ExtractDataAsJson(stream);
+ // Save the extracted output as a new json file.
+ pdf.Save("Output.json");
+ // Close the document.
pdf.Close(true);
}
@@ -519,7 +563,6 @@ using (FileStream stream = new FileStream("Input.pdf", FileMode.Open, FileAccess
{% highlight c# tabtitle="C# [Windows-specific]" %}
-using System.IO;
using Syncfusion.Pdf.Parsing;
using Syncfusion.SmartDataExtractor;
using Syncfusion.SmartTableExtractor;
@@ -527,12 +570,10 @@ using Syncfusion.SmartTableExtractor;
// Load the input PDF file.
using (FileStream stream = new FileStream("Input.pdf", FileMode.Open, FileAccess.Read))
{
- // Initialize the Smart Data Extractor.
+ // Initialize the Data Extractor.
DataExtractor extractor = new DataExtractor();
-
// Enable table detection and set confidence threshold.
extractor.EnableTableDetection = true;
-
// Configure table extraction options.
TableExtractionOptions tableOptions = new TableExtractionOptions();
// Extract tables across pages 1 to 5.
@@ -543,12 +584,11 @@ using (FileStream stream = new FileStream("Input.pdf", FileMode.Open, FileAccess
tableOptions.DetectBorderlessTables = true;
// Assign the table extraction options to the extractor.
extractor.TableExtractionOptions = tableOptions;
- // Extract data and return as a loaded PDF document.
- PdfLoadedDocument pdf = extractor.ExtractDataAsPdfDocument(stream);
-
- // Save the extracted output as a new PDF file.
- pdf.Save("Output.pdf");
- // Close the document to release resources.
+ // Extract data and return as a loaded json document.
+ PdfLoadedDocument pdf = extractor.ExtractDataAsJson(stream);
+ // Save the extracted output as a new json file.
+ pdf.Save("Output.json");
+ // Close the document.
pdf.Close(true);
}
@@ -556,34 +596,33 @@ using (FileStream stream = new FileStream("Input.pdf", FileMode.Open, FileAccess
{% endtabs %}
-## Apply Confidence Threshold to Extract the Data
+You can download a complete working sample from [GitHub](https://github.com/SyncfusionExamples/PDF-Examples/tree/master/Data-Extraction/Smart-Data-Extractor/Different-table-extraction-options/.NET).
+
+## Apply Confidence Threshold for Data Extraction
-To apply confidence thresholding when extracting data from a PDF document using the **ExtractDataAsPdfDocument** method of the **DataExtractor** class, refer to the following code example:
+To apply confidence thresholding when extracting data from a PDF document using the [ExtractDataAsJson](https://help.syncfusion.com/cr/document-processing/Syncfusion.SmartDataExtractor.DataExtractor.html#Syncfusion_SmartDataExtractor_DataExtractor_ExtractDataAsJson_System_IO_Stream_) method of the [DataExtractor](https://help.syncfusion.com/cr/document-processing/Syncfusion.SmartDataExtractor.DataExtractor.html) class, refer to the following code example:
{% tabs %}
{% highlight c# tabtitle="C# [Cross-platform]" %}
-using System.IO;
using Syncfusion.Pdf.Parsing;
using Syncfusion.SmartDataExtractor;
// Load the input PDF file.
using (FileStream stream = new FileStream("Input.pdf", FileMode.Open, FileAccess.Read))
{
- // Initialize the Smart Data Extractor.
+ // Initialize the Data Extractor.
DataExtractor extractor = new DataExtractor();
-
// Apply confidence threshold to extract the data.
// Only elements with confidence >= 0.75 will be included in the results.
//default confidence threshold value is 0.6
extractor.ConfidenceThreshold = 0.75;
- // Extract data and return as a loaded PDF document.
- PdfLoadedDocument pdf = extractor.ExtractDataAsPdfDocument(stream);
-
- // Save the extracted output as a new PDF file.
- pdf.Save("Output.pdf");
- // Close the document to release resources.
+ // Extract data and return as a loaded json document.
+ PdfLoadedDocument pdf = extractor.ExtractDataAsJson(stream);
+ // Save the extracted output as a new json file.
+ pdf.Save("Output.json");
+ // Close the document.
pdf.Close(true);
}
@@ -591,26 +630,23 @@ using (FileStream stream = new FileStream("Input.pdf", FileMode.Open, FileAccess
{% highlight c# tabtitle="C# [Windows-specific]" %}
-using System.IO;
using Syncfusion.Pdf.Parsing;
using Syncfusion.SmartDataExtractor;
// Load the input PDF file.
using (FileStream stream = new FileStream("Input.pdf", FileMode.Open, FileAccess.Read))
{
- // Initialize the Smart Data Extractor.
+ // Initialize the Data Extractor.
DataExtractor extractor = new DataExtractor();
-
// Apply confidence threshold to extract the data.
// Only elements with confidence >= 0.75 will be included in the results.
//default confidence threshold value is 0.6
extractor.ConfidenceThreshold = 0.75;
- // Extract data and return as a loaded PDF document.
- PdfLoadedDocument pdf = extractor.ExtractDataAsPdfDocument(stream);
-
- // Save the extracted output as a new PDF file.
- pdf.Save("Output.pdf");
- // Close the document to release resources.
+ // Extract data and return as a loaded json file.
+ PdfLoadedDocument pdf = extractor.ExtractDataAsJson(stream);
+ // Save the extracted output as a new json file.
+ pdf.Save("Output.json");
+ // Close the document.
pdf.Close(true);
}
@@ -618,32 +654,31 @@ using (FileStream stream = new FileStream("Input.pdf", FileMode.Open, FileAccess
{% endtabs %}
-## Extract Data Within a Specific Page Range
+You can download a complete working sample from [GitHub](https://github.com/SyncfusionExamples/PDF-Examples/tree/master/Data-Extraction/Smart-Data-Extractor/Apply-Confidence-threshold/.NET).
-To extract data from a specific range of pages in a PDF document using the ExtractDataAsPdfDocument method of the DataExtractor class, refer to the following code example:
+## Extract Data within a Specific Page Range
+
+To extract data from a specific range of pages in a PDF document using the [ExtractDataAsJson](https://help.syncfusion.com/cr/document-processing/Syncfusion.SmartDataExtractor.DataExtractor.html#Syncfusion_SmartDataExtractor_DataExtractor_ExtractDataAsJson_System_IO_Stream_) method of the [DataExtractor](https://help.syncfusion.com/cr/document-processing/Syncfusion.SmartDataExtractor.DataExtractor.html) class, refer to the following code example:
{% tabs %}
{% highlight c# tabtitle="C# [Cross-platform]" %}
-using System.IO;
using Syncfusion.Pdf.Parsing;
using Syncfusion.SmartDataExtractor;
//Open the input PDF file as a stream.
using (FileStream stream = new FileStream("Input.pdf", FileMode.Open, FileAccess.Read))
{
- //Initialize the Smart Data Extractor.
+ //Initialize the Data Extractor.
DataExtractor extractor = new DataExtractor();
-
//Set the page range for extraction (pages 1 to 3).
extractor.PageRange = new int[,] { { 1, 3 } };
- //Extract data and return as a loaded PDF document.
- PdfLoadedDocument pdf = extractor.ExtractDataAsPdfDocument(stream);
-
+ //Extract data and return as a loaded json document.
+ PdfLoadedDocument pdf = extractor.ExtractDataAsJson(stream);
//Save the extracted output as a new PDF file.
- pdf.Save("Output.pdf");
- //Close the document to release resources.
+ pdf.Save("Output.json");
+ //Close the document.
pdf.Close(true);
}
@@ -651,24 +686,21 @@ using (FileStream stream = new FileStream("Input.pdf", FileMode.Open, FileAccess
{% highlight c# tabtitle="C# [Windows-specific]" %}
-using System.IO;
using Syncfusion.Pdf.Parsing;
using Syncfusion.SmartDataExtractor;
//Open the input PDF file as a stream.
using (FileStream stream = new FileStream("Input.pdf", FileMode.Open, FileAccess.Read))
{
- //Initialize the Smart Data Extractor.
+ //Initialize the Data Extractor.
DataExtractor extractor = new DataExtractor();
-
//Set the page range for extraction (pages 1 to 3).
extractor.PageRange = new int[,] { { 1, 3 } };
- //Extract data and return as a loaded PDF document.
- PdfLoadedDocument pdf = extractor.ExtractDataAsPdfDocument(stream);
-
- //Save the extracted output as a new PDF file.
- pdf.Save("Output.pdf");
- //Close the document to release resources.
+ //Extract data and return as a loaded json document.
+ PdfLoadedDocument pdf = extractor.ExtractDataAsJson(stream);
+ //Save the extracted output as a new json file.
+ pdf.Save("Output.json");
+ //Close the document.
pdf.Close(true);
}
@@ -676,3 +708,8 @@ using (FileStream stream = new FileStream("Input.pdf", FileMode.Open, FileAccess
{% endtabs %}
+You can download a complete working sample from [GitHub](https://github.com/SyncfusionExamples/PDF-Examples/tree/master/Data-Extraction/Smart-Data-Extractor/Extract-data-within-specific-range/.NET).
+
+
+
+
diff --git a/Document-Processing/Data-Extraction/NET/working-with-form-recognition.md b/Document-Processing/Data-Extraction/NET/working-with-form-recognition.md
new file mode 100644
index 0000000000..4e76f53b22
--- /dev/null
+++ b/Document-Processing/Data-Extraction/NET/working-with-form-recognition.md
@@ -0,0 +1,303 @@
+---
+title: Working with Form Recognition | Syncfusion®
+description: Learn how to recognize forms and configure FormRecognizeOptions in Syncfusion® SmartFormRecognizer to detect and process form elements easily.
+platform: document-processing
+control: SmartFormRecognizer
+documentation: UG
+---
+
+# Working with Form Recognition
+
+## Recognize Forms as JSON
+
+To recognize form data from a PDF or image and get the output as a JSON string using the [RecognizeFormAsJson](https://help.syncfusion.com/cr/document-processing/Syncfusion.SmartFormRecognizer.FormRecognizer.html#Syncfusion_SmartFormRecognizer_FormRecognizer_RecognizeFormAsJson_System_IO_Stream_) (synchronous) and [RecognizeFormAsJsonAsync](https://help.syncfusion.com/cr/document-processing/Syncfusion.SmartFormRecognizer.FormRecognizer.html#Syncfusion_SmartFormRecognizer_FormRecognizer_RecognizeFormAsJsonAsync_System_IO_Stream_System_Threading_CancellationToken_) (asynchronous) methods of the [FormRecognizer](https://help.syncfusion.com/cr/document-processing/Syncfusion.SmartFormRecognizer.FormRecognizer.html) class, refer to the following code examples.
+
+Example (synchronous):
+
+{% tabs %}
+{% highlight c# tabtitle="C#" playgroundButtonLink="https://raw.githubusercontent.com/SyncfusionExamples/PDF-Examples/refs/heads/master/Data-Extraction/Smart-Form-Recognizer/Recognize-forms-using-JSON/.NET/Recognize-forms-using-JSON/Program.cs" %}
+
+
+using Syncfusion.SmartFormRecognizer;
+
+// Open the input PDF file as a stream.
+using (FileStream inputStream = new FileStream("Input.pdf", FileMode.Open, FileAccess.ReadWrite))
+{
+ // Initialize the Form Recognizer.
+ FormRecognizer recognizer = new FormRecognizer();
+ // Recognize the form and get the output as a JSON string.
+ string outputJson = recognizer.RecognizeFormAsJson(inputStream);
+ // Save the extracted JSON data into an output file.
+ File.WriteAllText("result.json", outputJson);
+}
+
+{% endhighlight %}
+{% endtabs %}
+
+You can download a complete working sample from [GitHub](https://github.com/SyncfusionExamples/PDF-Examples/tree/master/Data-Extraction/Smart-Form-Recognizer/Recognize-forms-using-JSON/.NET).
+
+Example (asynchronous):
+
+{% tabs %}
+{% highlight c# tabtitle="C#" %}
+
+using Syncfusion.SmartFormRecognizer;
+
+// Open the input PDF file as a stream.
+using (FileStream inputStream = new FileStream("Input.pdf", FileMode.Open, FileAccess.ReadWrite))
+{
+ // Initialize the Form Recognizer.
+ FormRecognizer recognizer = new FormRecognizer();
+ // Recognize the form asynchronously and get the output as a JSON string.
+ string outputJson = await recognizer.RecognizeFormAsJsonAsync(inputStream);
+ // Save the extracted JSON data into an output file.
+ File.WriteAllText("result.json", outputJson);
+}
+
+{% endhighlight %}
+{% endtabs %}
+
+You can download a complete working sample from [GitHub](https://github.com/SyncfusionExamples/PDF-Examples/tree/master/Data-Extraction/Smart-Form-Recognizer/Recognize-forms-using-JSON-async/.NET).
+
+
+## Recognize Forms from PDF or Image
+
+### Recognize forms as PDF
+
+To recognize form data from a PDF or image and get the output as a `PdfLoadedDocument` using the [RecognizeFormAsPdfDocument](https://help.syncfusion.com/cr/document-processing/Syncfusion.SmartFormRecognizer.FormRecognizer.html#Syncfusion_SmartFormRecognizer_FormRecognizer_RecognizeFormAsPdfDocument_System_IO_Stream_) (synchronous) and [RecognizeFormAsPdfDocumentAsync](https://help.syncfusion.com/cr/document-processing/Syncfusion.SmartFormRecognizer.FormRecognizer.html#Syncfusion_SmartFormRecognizer_FormRecognizer_RecognizeFormAsPdfDocumentAsync_System_IO_Stream_System_Threading_CancellationToken_) (asynchronous) methods of the [FormRecognizer](https://help.syncfusion.com/cr/document-processing/Syncfusion.SmartFormRecognizer.FormRecognizer.html) class, refer to the following code examples.
+
+Example (synchronous):
+
+{% tabs %}
+{% highlight c# tabtitle="C#" %}
+
+using Syncfusion.SmartFormRecognizer;
+using Syncfusion.Pdf.Parsing;
+
+// Open the input PDF file as a stream.
+using (FileStream inputStream = new FileStream("Input.pdf", FileMode.Open, FileAccess.ReadWrite))
+{
+ // Initialize the Form Recognizer.
+ FormRecognizer recognizer = new FormRecognizer();
+ // Recognize the form and get the output as a PDF document.
+ PdfLoadedDocument document = recognizer.RecognizeFormAsPdfDocument(inputStream);
+ // Save the recognized document.
+ document.Save("Output.pdf");
+ // Close the document.
+ document.Close();
+}
+
+{% endhighlight %}
+{% endtabs %}
+
+You can download a complete working sample from [GitHub](https://github.com/SyncfusionExamples/PDF-Examples/tree/master/Data-Extraction/Smart-Form-Recognizer/Recognize-forms-using-Pdf/.NET).
+
+Example (asynchronous):
+
+{% tabs %}
+{% highlight c# tabtitle="C#" %}
+
+using Syncfusion.Pdf.Parsing;
+using Syncfusion.SmartFormRecognizer;
+
+// Open the input PDF file as a stream.
+using (FileStream inputStream = new FileStream("Input.pdf", FileMode.Open, FileAccess.ReadWrite))
+{
+ // Initialize the Form Recognizer.
+ FormRecognizer recognizer = new FormRecognizer();
+ // Recognize the form asynchronously and get the output as a PDF document.
+ PdfLoadedDocument document = await recognizer.RecognizeFormAsPdfDocumentAsync(inputStream);
+ // Save the recognized document.
+ document.Save("Output.pdf");
+ // Close the document.
+ document.Close();
+}
+
+
+{% endhighlight %}
+{% endtabs %}
+
+You can download a complete working sample from [GitHub](https://github.com/SyncfusionExamples/PDF-Examples/tree/master/Data-Extraction/Smart-Form-Recognizer/Recognize-forms-using-Pdf-async/.NET).
+
+### Recognize forms as Stream
+
+To recognize form data from a PDF or image and get the output as a `Stream` using the [RecognizeFormAsPdfStream](https://help.syncfusion.com/cr/document-processing/Syncfusion.SmartFormRecognizer.FormRecognizer.html#Syncfusion_SmartFormRecognizer_FormRecognizer_RecognizeFormAsPdfStream_System_IO_Stream_) (synchronous) and [RecognizeFormAsPdfStreamAsync](https://help.syncfusion.com/cr/document-processing/Syncfusion.SmartFormRecognizer.FormRecognizer.html#Syncfusion_SmartFormRecognizer_FormRecognizer_RecognizeFormAsPdfStreamAsync_System_IO_Stream_System_Threading_CancellationToken_) (asynchronous) methods of the [FormRecognizer](https://help.syncfusion.com/cr/document-processing/Syncfusion.SmartFormRecognizer.FormRecognizer.html) class, refer to the following code examples.
+
+Example (synchronous):
+
+{% tabs %}
+{% highlight c# tabtitle="C#" %}
+
+using Syncfusion.SmartFormRecognizer;
+
+// Open the input PDF file as a stream.
+using (FileStream inputStream = new FileStream("Input.pdf", FileMode.Open, FileAccess.ReadWrite))
+{
+ // Initialize the Form Recognizer.
+ FormRecognizer recognizer = new FormRecognizer();
+ // Recognize the form and get the output as a PDF stream.
+ using (Stream outputStream = recognizer.RecognizeFormAsPdfStream(inputStream))
+ {
+ // Save the output PDF stream to a file.
+ using (FileStream fileStream = File.Create("Output.pdf"))
+ {
+ outputStream.Seek(0, SeekOrigin.Begin);
+ outputStream.CopyTo(fileStream);
+ }
+ }
+}
+
+{% endhighlight %}
+{% endtabs %}
+
+You can download a complete working sample from [GitHub](https://github.com/SyncfusionExamples/PDF-Examples/tree/master/Data-Extraction/Smart-Form-Recognizer/Recognize-forms-using-Stream/.NET).
+
+Example (asynchronous):
+
+{% tabs %}
+{% highlight c# tabtitle="C#" %}
+
+using Syncfusion.SmartFormRecognizer;
+
+// Open the input PDF file as a stream.
+using (FileStream inputStream = new FileStream("Input.pdf", FileMode.Open, FileAccess.ReadWrite))
+{
+ // Initialize the Form Recognizer.
+ FormRecognizer recognizer = new FormRecognizer();
+ // Recognize the form asynchronously and get the output as a PDF stream.
+ using (Stream outputStream = await recognizer.RecognizeFormAsPdfStreamAsync(inputStream))
+ {
+ // Save the output PDF stream to a file.
+ using (FileStream fileStream = File.Create("Output.pdf"))
+ {
+ outputStream.Seek(0, SeekOrigin.Begin);
+ outputStream.CopyTo(fileStream);
+ }
+ }
+}
+
+
+{% endhighlight %}
+{% endtabs %}
+
+You can download a complete working sample from [GitHub](https://github.com/SyncfusionExamples/PDF-Examples/tree/master/Data-Extraction/Smart-Form-Recognizer/Recognize-forms-using-Stream-async/.NET).
+
+
+## Async variants with CancellationToken
+
+To recognize form data asynchronously with cancellation support using the [RecognizeFormAsPdfStreamAsync](https://help.syncfusion.com/cr/document-processing/Syncfusion.SmartFormRecognizer.FormRecognizer.html#Syncfusion_SmartFormRecognizer_FormRecognizer_RecognizeFormAsPdfStreamAsync_System_IO_Stream_System_Threading_CancellationToken_) method of the [FormRecognizer](https://help.syncfusion.com/cr/document-processing/Syncfusion.SmartFormRecognizer.FormRecognizer.html) class, refer to the following code example.
+
+Example with cancellation token (PDF stream):
+
+{% tabs %}
+{% highlight c# tabtitle="C#" %}
+
+using Syncfusion.SmartFormRecognizer;
+
+// Open the input PDF file as a stream.
+using FileStream inputStream = new FileStream(Path.GetFullPath("Input.pdf"), FileMode.Open, FileAccess.Read);
+// Initialize the Form Recognizer.
+FormRecognizer recognizer = new FormRecognizer();
+// Create a cancellation token that cancels after 5 seconds.
+using CancellationTokenSource cts = new CancellationTokenSource(TimeSpan.FromSeconds(5));
+CancellationToken token = cts.Token;
+// Recognize the form asynchronously and get the output as a PDF stream.
+using Stream resultStream = await recognizer.RecognizeFormAsPdfStreamAsync(inputStream, token);
+// Save the output PDF stream to a file.
+using FileStream fileStream = File.Create(Path.GetFullPath("Output.pdf"));
+await resultStream.CopyToAsync(fileStream, token);
+
+{% endhighlight %}
+{% endtabs %}
+
+You can download a complete working sample from [GitHub](https://github.com/SyncfusionExamples/PDF-Examples/tree/master/Data-Extraction/Smart-Form-Recognizer/Asyncvariants-with-CancellationToken/.NET).
+
+## Working with Form Recognize Options
+
+`FormRecognizeOptions` provides configurable settings that control how the SmartFormRecognizer detects elements from a document. It allows you to enable or disable the detection of specific form controls such as checkboxes, radio buttons, textboxes, and signatures—while also letting you fine-tune the recognition results using a confidence threshold.
+
+Additionally, it supports restricting processing to specific pages through an optional 1‑based inclusive PageRange. By adjusting these options, developers can optimize performance, reduce noise in results, and tailor form extraction precisely to the needs of their application.
+
+### Disable Textbox Detection
+
+To disable textbox field detection in the [FormRecognizeOptions](https://help.syncfusion.com/cr/document-processing/Syncfusion.SmartFormRecognizer.FormRecognizer.html#Syncfusion_SmartFormRecognizer_FormRecognizer_FormRecognizeOptions) of the [FormRecognizer](https://help.syncfusion.com/cr/document-processing/Syncfusion.SmartFormRecognizer.FormRecognizer.html) class, refer to the following code example.
+
+{% tabs %}
+{% highlight c# tabtitle="C#" %}
+
+FormRecognizer recognizer = new FormRecognizer();
+// Disable textbox detection
+recognizer.FormRecognizeOptions.DetectTextboxes = false;
+
+{% endhighlight %}
+{% endtabs %}
+
+### Disable Checkbox Detection
+To disable checkbox detection in the [FormRecognizeOptions](https://help.syncfusion.com/cr/document-processing/Syncfusion.SmartFormRecognizer.FormRecognizer.html#Syncfusion_SmartFormRecognizer_FormRecognizer_FormRecognizeOptions) of the [FormRecognizer](https://help.syncfusion.com/cr/document-processing/Syncfusion.SmartFormRecognizer.FormRecognizer.html) class, refer to the following code example.
+
+{% tabs %}
+{% highlight c# tabtitle="C#" %}
+
+FormRecognizer recognizer = new FormRecognizer();
+// Disable checkbox detection
+recognizer.FormRecognizeOptions.DetectCheckboxes = false;
+
+{% endhighlight %}
+{% endtabs %}
+
+### Disable RadioButtons Detection
+To disable radio button detection in the [FormRecognizeOptions](https://help.syncfusion.com/cr/document-processing/Syncfusion.SmartFormRecognizer.FormRecognizer.html#Syncfusion_SmartFormRecognizer_FormRecognizer_FormRecognizeOptions) of the [FormRecognizer](https://help.syncfusion.com/cr/document-processing/Syncfusion.SmartFormRecognizer.FormRecognizer.html) class, refer to the following code example.
+
+{% tabs %}
+{% highlight c# tabtitle="C#" %}
+
+FormRecognizer recognizer = new FormRecognizer();
+// Disable radio button detection
+recognizer.FormRecognizeOptions.DetectRadioButtons = false;
+
+{% endhighlight %}
+{% endtabs %}
+
+### Disable Signature Detection
+To disable signature field detection in the [FormRecognizeOptions](https://help.syncfusion.com/cr/document-processing/Syncfusion.SmartFormRecognizer.FormRecognizer.html#Syncfusion_SmartFormRecognizer_FormRecognizer_FormRecognizeOptions) of the [FormRecognizer](https://help.syncfusion.com/cr/document-processing/Syncfusion.SmartFormRecognizer.FormRecognizer.html) class, refer to the following code example.
+
+{% tabs %}
+{% highlight c# tabtitle="C#" %}
+
+FormRecognizer recognizer = new FormRecognizer();
+// Disable signature detection
+recognizer.FormRecognizeOptions.DetectSignatures = false;
+
+{% endhighlight %}
+{% endtabs %}
+
+### Set Confidence Threshold
+
+To set a minimum confidence score for detected form elements using the [ConfidenceThreshold](https://help.syncfusion.com/cr/document-processing/Syncfusion.SmartFormRecognizer.FormRecognizeOptions.html#Syncfusion_SmartFormRecognizer_FormRecognizeOptions_ConfidenceThreshold) in the [FormRecognizeOptions](https://help.syncfusion.com/cr/document-processing/Syncfusion.SmartFormRecognizer.FormRecognizer.html#Syncfusion_SmartFormRecognizer_FormRecognizer_FormRecognizeOptions) of the [FormRecognizer](https://help.syncfusion.com/cr/document-processing/Syncfusion.SmartFormRecognizer.FormRecognizer.html) class, refer to the following code example.
+
+{% tabs %}
+{% highlight c# tabtitle="C#" %}
+
+FormRecognizer recognizer = new FormRecognizer();
+// Set a ConfidenceThreshold
+recognizer.FormRecognizeOptions.ConfidenceThreshold = 0.9;
+
+{% endhighlight %}
+{% endtabs %}
+
+### Set Page Range
+
+To specify which pages to process using the **PageRange** in the [FormRecognizeOptions](https://help.syncfusion.com/cr/document-processing/Syncfusion.SmartFormRecognizer.FormRecognizer.html#Syncfusion_SmartFormRecognizer_FormRecognizer_FormRecognizeOptions) of the [FormRecognizer](https://help.syncfusion.com/cr/document-processing/Syncfusion.SmartFormRecognizer.FormRecognizer.html) class, refer to the following code example.
+
+{% tabs %}
+{% highlight c# tabtitle="C#" %}
+
+FormRecognizer recognizer = new FormRecognizer();
+// Set a single page range – detects only the specified page
+recognizer.FormRecognizeOptions.PageRange = new int[,] { { 3 }, { 8 } };
+
+// Set a page range – detects content between the specified start and end page
+recognizer.FormRecognizeOptions.PageRange = new int[,] { { 3, 8 } };
+
+{% endhighlight %}
+{% endtabs %}
\ No newline at end of file
diff --git a/Document-Processing/Data-Extraction/Smart-Table-Extractor/NET/Features.md b/Document-Processing/Data-Extraction/NET/working-with-table-extraction.md
similarity index 53%
rename from Document-Processing/Data-Extraction/Smart-Table-Extractor/NET/Features.md
rename to Document-Processing/Data-Extraction/NET/working-with-table-extraction.md
index cb4699cc27..c81f8fb6c4 100644
--- a/Document-Processing/Data-Extraction/Smart-Table-Extractor/NET/Features.md
+++ b/Document-Processing/Data-Extraction/NET/working-with-table-extraction.md
@@ -1,30 +1,34 @@
---
-title: Features of Smart table Extractor | Syncfusion
-description: Discover the key features of Syncfusion Smart Table Extractor, a .NET library designed to extract tables, forms, text, and images from documents.
+title: Working with Table Extraction | Syncfusion
+description: Syncfusion® Smart Table Extractor is a .NET library that extracts structured table data from PDF and image files with confidence scoring.
platform: document-processing
-control: PDF
+control: SmartTableExtractor
documentation: UG
-keywords: Assemblies
+keywords: Table Extraction, PDF, Image, Smart Table Extractor, Assemblies, NuGet Packages
---
-# Smart Table Extractor Features
+# Working with Table Extraction
-## Extract Tables from a PDF Document
+The Syncfusion® Smart Table Extractor is a .NET library used to extract structured table data from PDF and image files.
-To extract structured table data from a PDF document using the **ExtractTableAsJson** method of the **TableExtractor** class, refer to the following code
+To quickly get started with extracting table data from PDF and image files in ASP.NET Core using the Smart Table Extractor library, refer to this video tutorial:
+{% youtube "https://www.youtube.com/watch?v=q1rs_8nTG5M" %}
+
+## Extract Table Data as JSON from PDF or Image
+
+To extract structured table data from a PDF document using the [ExtractTableAsJson](https://help.syncfusion.com/cr/document-processing/Syncfusion.SmartTableExtractor.TableExtractor.html#Syncfusion_SmartTableExtractor_TableExtractor_ExtractTableAsJson_System_IO_Stream_) method of the [TableExtractor](https://help.syncfusion.com/cr/document-processing/Syncfusion.SmartTableExtractor.TableExtractor.html) class, refer to the following code
{% tabs %}
-{% highlight c# tabtitle="C# [Cross-platform]" %}
+{% highlight c# tabtitle="C# [Cross-platform]" playgroundButtonLink="https://raw.githubusercontent.com/SyncfusionExamples/PDF-Examples/refs/heads/master/Data-Extraction/Smart-Table-Extractor/Extract-tables-from-pdf-document/.NET/Extract-tables-from-pdf-document/Program.cs" %}
-using System.IO;
using System.Text;
using Syncfusion.SmartTableExtractor;
//Open the input PDF file as a stream.
using (FileStream stream = new FileStream("Input.pdf", FileMode.Open, FileAccess.Read))
{
- // Initialize the Smart Table Extractor
+ // Initialize the Table Extractor
TableExtractor extractor = new TableExtractor();
//Extract table data from the PDF document as JSON string.
string data = extractor.ExtractTableAsJson(stream);
@@ -36,7 +40,6 @@ using (FileStream stream = new FileStream("Input.pdf", FileMode.Open, FileAccess
{% highlight c# tabtitle="C# [Windows-specific]" %}
-using System.IO;
using System.Text;
using Syncfusion.SmartTableExtractor;
@@ -55,72 +58,68 @@ using (FileStream stream = new FileStream("Input.pdf", FileMode.Open, FileAccess
{% endtabs %}
-## Extract Tables with Border-less Table Detection
+N> To convert an image instead of a PDF, replace the input stream with the image file (for example, *Input.jpg* or *Input.png*). The rest of the code remains unchanged.
-To extract structured table data from a PDF document that contains tables without visible borders using the **ExtractTableAsJson** method of the **TableExtractor** class, refer to the following code examples.
+You can download a complete working sample from [GitHub](https://github.com/SyncfusionExamples/PDF-Examples/tree/master/Data-Extraction/Smart-Table-Extractor/Extract-tables-from-pdf-document/.NET).
-{% tabs %}
+## Extract Table Data as Markdown from PDF or Image
+
+To extract structured table data from a PDF document using the [ExtractTableAsMarkdown](https://help.syncfusion.com/cr/document-processing/Syncfusion.SmartTableExtractor.TableExtractor.html#Syncfusion_SmartTableExtractor_TableExtractor_ExtractTableAsMarkdown_System_IO_Stream_) method of the [TableExtractor](https://help.syncfusion.com/cr/document-processing/Syncfusion.SmartTableExtractor.TableExtractor.html) class, refer to the following code
+
+{% tabs %}
{% highlight c# tabtitle="C# [Cross-platform]" %}
-using System.IO;
using System.Text;
using Syncfusion.SmartTableExtractor;
//Open the input PDF file as a stream.
using (FileStream stream = new FileStream("Input.pdf", FileMode.Open, FileAccess.Read))
{
- //Initialize the Smart Table Extractor.
+ // Initialize the Table Extractor
TableExtractor extractor = new TableExtractor();
- //Configure the table extraction option to detect border-less tables in the document.
- TableExtractionOptions options = new TableExtractionOptions();
- options.DetectBorderlessTables = true;
-
- //Assign the configured options to the extractor.
- extractor.TableExtractionOptions = options;
- //Extract table data from the PDF document as a JSON string.
- string data = extractor.ExtractTableAsJson(stream);
- //Save the extracted JSON data into an output file.
- File.WriteAllText("Output.json", data, Encoding.UTF8);
+ //Extract table data from the PDF document as markdown.
+ string data = extractor.ExtractTableAsMarkdown(stream);
+ //Save the extracted markdown data into an output file.
+ File.WriteAllText("Output.md", data, Encoding.UTF8);
}
{% endhighlight %}
{% highlight c# tabtitle="C# [Windows-specific]" %}
-using System.IO;
using System.Text;
using Syncfusion.SmartTableExtractor;
//Open the input PDF file as a stream.
using (FileStream stream = new FileStream("Input.pdf", FileMode.Open, FileAccess.Read))
{
- //Initialize the Smart Table Extractor.
+ // Initialize the Table Extractor
TableExtractor extractor = new TableExtractor();
- //Configure the table extraction option to detect border-less tables in the document.
- TableExtractionOptions options = new TableExtractionOptions();
- options.DetectBorderlessTables = true;
- //Assign the configured options to the extractor.
- extractor.TableExtractionOptions = options;
- //Extract table data from the PDF document as a JSON string.
- string data = extractor.ExtractTableAsJson(stream);
- //Save the extracted JSON data into an output file.
- File.WriteAllText("Output.json", data, Encoding.UTF8);
+ //Extract table data from the PDF document as markdown.
+ string data = extractor.ExtractTableAsMarkdown(stream);
+ //Save the extracted markdown data into an output file.
+ File.WriteAllText("Output.md", data, Encoding.UTF8);
}
{% endhighlight %}
-{% endtabs %}
+{% endtabs %}
-## Extract Tables within a Specific Page Range
+You can download a complete working sample from [GitHub](https://github.com/SyncfusionExamples/PDF-Examples/tree/master/Data-Extraction/Smart-Table-Extractor/Extract-table-data-as-MD-from-PDF/.NET).
-To extract structured table data from a specific range of pages in a PDF document using the **ExtractTableAsJson** method of the **TableExtractor** class, refer to the following code example:
+N> To convert an image instead of a PDF, replace the input stream with the image file (for example, *Input.jpg* or *Input.png*). The rest of the code remains unchanged.
+
+## Extract Table Data within a Specific Page Range
+
+### Extract as JSON
+
+To extract structured table data from a specific range of pages in a PDF document using the [ExtractTableAsJson](https://help.syncfusion.com/cr/document-processing/Syncfusion.SmartTableExtractor.TableExtractor.html#Syncfusion_SmartTableExtractor_TableExtractor_ExtractTableAsJson_System_IO_Stream_) method of the [TableExtractor](https://help.syncfusion.com/cr/document-processing/Syncfusion.SmartTableExtractor.TableExtractor.html) class, refer to the following code example:
{% tabs %}
{% highlight c# tabtitle="C# [Cross-platform]" %}
-using System.IO;
using System.Text;
using Syncfusion.SmartTableExtractor;
@@ -144,7 +143,6 @@ using (FileStream stream = new FileStream("Input.pdf", FileMode.Open, FileAccess
{% highlight c# tabtitle="C# [Windows-specific]" %}
-using System.IO;
using System.Text;
using Syncfusion.SmartTableExtractor;
@@ -168,15 +166,16 @@ using (FileStream stream = new FileStream("Input.pdf", FileMode.Open, FileAccess
{% endtabs %}
-## Apply a Confidence Threshold to Extract Table Data
+You can download a complete working sample from [GitHub](https://github.com/SyncfusionExamples/PDF-Examples/tree/master/Data-Extraction/Smart-Table-Extractor/Extract-tables-by-page-range/.NET).
-To apply confidence thresholding when extracting table data from a PDF document using the **ExtractTableAsJson** method of the **TableExtractor** class, refer to the following code example:
+### Extract as Markdown
+
+To extract structured table data from a specific range of pages in a PDF document or Image using the [ExtractTableAsMarkdown](https://help.syncfusion.com/cr/document-processing/Syncfusion.SmartTableExtractor.TableExtractor.html#Syncfusion_SmartTableExtractor_TableExtractor_ExtractTableAsMarkdown_System_IO_Stream_) method of the [TableExtractor](https://help.syncfusion.com/cr/document-processing/Syncfusion.SmartTableExtractor.TableExtractor.html) class, refer to the following code example:
{% tabs %}
{% highlight c# tabtitle="C# [Cross-platform]" %}
-using System.IO;
using System.Text;
using Syncfusion.SmartTableExtractor;
@@ -185,56 +184,51 @@ using (FileStream stream = new FileStream("Input.pdf", FileMode.Open, FileAccess
{
//Initialize the Smart Table Extractor.
TableExtractor extractor = new TableExtractor();
- //Configure table extraction options to set the confidence threshold for detection.
+ //Set the page range for extraction (pages 1 to 3).
TableExtractionOptions options = new TableExtractionOptions();
- options.ConfidenceThreshold = 0.6;
- //Assign the configured options to the extractor.
+ options.PageRange = new int[,] { { 1, 3 } };
extractor.TableExtractionOptions = options;
- //Extract table data from the PDF document as a JSON string.
- string data = extractor.ExtractTableAsJson(stream);
- //Save the extracted JSON data into an output file.
- File.WriteAllText("Output.json", data, Encoding.UTF8);
+ //Extract table data from the specified page range as a Markdown string.
+ string data = extractor.ExtractTableAsMarkdown(stream);
+ //Save the extracted output as a new Markdown file.
+ File.WriteAllText("Output.md", data, Encoding.UTF8);
}
-
+
{% endhighlight %}
{% highlight c# tabtitle="C# [Windows-specific]" %}
-using System.IO;
using System.Text;
using Syncfusion.SmartTableExtractor;
//Open the input PDF file as a stream.
using (FileStream stream = new FileStream("Input.pdf", FileMode.Open, FileAccess.Read))
{
- //Initialize the Smart Table Extractor.
- TableExtractor extractor = new TableExtractor();
- //Configure table extraction options to set the confidence threshold for detection.
- TableExtractionOptions options = new TableExtractionOptions();
- options.ConfidenceThreshold = 0.6;
- //Assign the configured options to the extractor.
- extractor.TableExtractionOptions = options;
- //Extract table data from the PDF document as a JSON string.
- string data = extractor.ExtractTableAsJson(stream);
- //Save the extracted JSON data into an output file.
- File.WriteAllText("Output.json", data, Encoding.UTF8);
+ //Initialize the Smart Table Extractor.
+ TableExtractor extractor = new TableExtractor();
+ //Set the page range for extraction (pages 1 to 3).
+ TableExtractionOptions options = new TableExtractionOptions();
+ options.PageRange = new int[,] { { 1, 3 } };
+ extractor.TableExtractionOptions = options;
+ //Extract table data from the specified page range as a Markdown string.
+ string data = extractor.ExtractTableAsMarkdown(stream);
+ //Save the extracted output as a new Markdown file.
+ File.WriteAllText("Output.md", data, Encoding.UTF8);
}
{% endhighlight %}
-{% endtabs %}
+{% endtabs %}
-## Extract Table Data Asynchronously from a PDF Document
+## Extract Table Data Asynchronously from PDF or Image
-To extract table data asynchronously with cancellation support using the **ExtractTableAsJsonAsync** method of the **TableExtractor** class, refer to the following code example:
+To extract table data asynchronously with cancellation support using the [ExtractTableAsJsonAsync](https://help.syncfusion.com/cr/document-processing/Syncfusion.SmartTableExtractor.TableExtractor.html#Syncfusion_SmartTableExtractor_TableExtractor_ExtractTableAsJsonAsync_System_IO_Stream_System_Threading_CancellationToken_) method of the [TableExtractor](https://help.syncfusion.com/cr/document-processing/Syncfusion.SmartTableExtractor.TableExtractor.html) class, refer to the following code example:
{% tabs %}
{% highlight c# tabtitle="C# [Cross-platform]" %}
-using System.IO;
using System.Text;
-using System.Threading;
using Syncfusion.SmartTableExtractor;
//Open the input PDF file as a stream.
@@ -254,9 +248,7 @@ using (FileStream stream = new FileStream("Input.pdf", FileMode.Open, FileAccess
{% highlight c# tabtitle="C# [Windows-specific]" %}
-using System.IO;
using System.Text;
-using System.Threading;
using Syncfusion.SmartTableExtractor;
//Open the input PDF file as a stream.
@@ -276,49 +268,143 @@ using (FileStream stream = new FileStream("Input.pdf", FileMode.Open, FileAccess
{% endtabs %}
-## Extract Table data as Markdown from a PDF Document
+You can download a complete working sample from [GitHub](https://github.com/SyncfusionExamples/PDF-Examples/tree/master/Data-Extraction/Smart-Table-Extractor/Extract-table-data-async/.NET).
-To extract structured table data from a PDF document using the **ExtractTableAsMarkdown** method of the **TableExtractor** class, refer to the following code
-{% tabs %}
+## Table Extraction Options
+
+### Disable Border-less Table Detection
+
+To disable detection of tables without visible borders in a PDF document or Image using the [ExtractTableAsJson](https://help.syncfusion.com/cr/document-processing/Syncfusion.SmartTableExtractor.TableExtractor.html#Syncfusion_SmartTableExtractor_TableExtractor_ExtractTableAsJson_System_IO_Stream_) method of the [TableExtractor](https://help.syncfusion.com/cr/document-processing/Syncfusion.SmartTableExtractor.TableExtractor.html) class, refer to the following code examples.
+
+{% tabs %}
{% highlight c# tabtitle="C# [Cross-platform]" %}
-using System.IO;
using System.Text;
using Syncfusion.SmartTableExtractor;
//Open the input PDF file as a stream.
using (FileStream stream = new FileStream("Input.pdf", FileMode.Open, FileAccess.Read))
{
- // Initialize the Smart Table Extractor
+ //Initialize the Smart Table Extractor.
TableExtractor extractor = new TableExtractor();
- //Extract table data from the PDF document as markdown.
- string data = extractor.ExtractTableAsMarkdown(stream);
- //Save the extracted markdown data into an output file.
- File.WriteAllText("Output.md", data, Encoding.UTF8);
+ //Configure the table extraction option to disable border-less tables in the document.
+ TableExtractionOptions options = new TableExtractionOptions();
+ //By default *DetectBorderlessTables is true*
+ options.DetectBorderlessTables = false;
+ //Assign the configured options to the extractor.
+ extractor.TableExtractionOptions = options;
+ //Extract table data from the PDF document as a JSON string.
+ string data = extractor.ExtractTableAsJson(stream);
+ //Save the extracted JSON data into an output file.
+ File.WriteAllText("Output.json", data, Encoding.UTF8);
}
{% endhighlight %}
{% highlight c# tabtitle="C# [Windows-specific]" %}
-using System.IO;
using System.Text;
using Syncfusion.SmartTableExtractor;
//Open the input PDF file as a stream.
using (FileStream stream = new FileStream("Input.pdf", FileMode.Open, FileAccess.Read))
{
- // Initialize the Smart Table Extractor
+ //Initialize the Smart Table Extractor.
TableExtractor extractor = new TableExtractor();
- //Extract table data from the PDF document as markdown.
- string data = extractor.ExtractTableAsMarkdown(stream);
- //Save the extracted markdown data into an output file.
- File.WriteAllText("Output.md", data, Encoding.UTF8);
+ //Configure the table extraction option to detect border-less tables in the document.
+ TableExtractionOptions options = new TableExtractionOptions();
+ options.DetectBorderlessTables = true;
+ //Assign the configured options to the extractor.
+ extractor.TableExtractionOptions = options;
+ //Extract table data from the PDF document as a JSON string.
+ string data = extractor.ExtractTableAsJson(stream);
+ //Save the extracted JSON data into an output file.
+ File.WriteAllText("Output.json", data, Encoding.UTF8);
}
{% endhighlight %}
-{% endtabs %}
+{% endtabs %}
+
+You can download a complete working sample from [GitHub](https://github.com/SyncfusionExamples/PDF-Examples/tree/master/Data-Extraction/Smart-Table-Extractor/Extract-border-less-table-detection/.NET).
+
+### Apply Confidence Threshold for Table Data Extraction
+
+To apply confidence thresholding when extracting table data from a PDF document using the [ExtractTableAsJson](https://help.syncfusion.com/cr/document-processing/Syncfusion.SmartTableExtractor.TableExtractor.html#Syncfusion_SmartTableExtractor_TableExtractor_ExtractTableAsJson_System_IO_Stream_) method of the [TableExtractor](https://help.syncfusion.com/cr/document-processing/Syncfusion.SmartTableExtractor.TableExtractor.html) class, refer to the following code example:
+
+{% tabs %}
+
+{% highlight c# tabtitle="C# [Cross-platform]" %}
+
+using System.Text;
+using Syncfusion.SmartTableExtractor;
+
+//Open the input PDF file as a stream.
+using (FileStream stream = new FileStream("Input.pdf", FileMode.Open, FileAccess.Read))
+{
+ //Initialize the Smart Table Extractor.
+ TableExtractor extractor = new TableExtractor();
+ //Configure table extraction options to set the confidence threshold for detection.
+ TableExtractionOptions options = new TableExtractionOptions();
+ options.ConfidenceThreshold = 0.6;
+ //Assign the configured options to the extractor.
+ extractor.TableExtractionOptions = options;
+ //Extract table data from the PDF document as a JSON string.
+ string data = extractor.ExtractTableAsJson(stream);
+ //Save the extracted JSON data into an output file.
+ File.WriteAllText("Output.json", data, Encoding.UTF8);
+}
+
+{% endhighlight %}
+
+{% highlight c# tabtitle="C# [Windows-specific]" %}
+
+using System.Text;
+using Syncfusion.SmartTableExtractor;
+
+//Open the input PDF file as a stream.
+using (FileStream stream = new FileStream("Input.pdf", FileMode.Open, FileAccess.Read))
+{
+ //Initialize the Smart Table Extractor.
+ TableExtractor extractor = new TableExtractor();
+ //Configure table extraction options to set the confidence threshold for detection.
+ TableExtractionOptions options = new TableExtractionOptions();
+ options.ConfidenceThreshold = 0.6;
+ //Assign the configured options to the extractor.
+ extractor.TableExtractionOptions = options;
+ //Extract table data from the PDF document as a JSON string.
+ string data = extractor.ExtractTableAsJson(stream);
+ //Save the extracted JSON data into an output file.
+ File.WriteAllText("Output.json", data, Encoding.UTF8);
+}
+
+{% endhighlight %}
+
+{% endtabs %}
+You can download a complete working sample from [GitHub](https://github.com/SyncfusionExamples/PDF-Examples/tree/master/Data-Extraction/Smart-Table-Extractor/Apply-confidence-threshold/.NET).
+
+## PDF to Markdown Preservation Mapping
+
+This section illustrates how table elements in PDF documents are converted and preserved in Markdown format, ensuring that document structure and formatting remain consistent during the PDF‑to‑Markdown conversion process.
+
+
+
+
+ PDF Elements
+ Preservation in Markdown
+
+
+
+
+ Table
+ Table
+
+
+ Text Inline Styles
+ Bold and Italic
+
+
+
diff --git a/Document-Processing/Data-Extraction/OCR/NET/Assemblies-Required.md b/Document-Processing/Data-Extraction/OCR/NET/Assemblies-Required.md
deleted file mode 100644
index 8f19c56d27..0000000000
--- a/Document-Processing/Data-Extraction/OCR/NET/Assemblies-Required.md
+++ /dev/null
@@ -1,65 +0,0 @@
----
-title: Assemblies Required for OCR | Syncfusion
-description: This section describes the required Syncfusion assemblies needed to integrate and use the OCR Processor effectively in your applications
-platform: document-processing
-control: PDF
-documentation: UG
-keywords: Assemblies
----
-# Assemblies Required to work with OCR processor
-
-Get the following required assemblies by downloading the OCR library installer. Download and install the OCR library for Windows, Linux, and Mac respectively. Please refer to the advanced installation steps for more details.
-
-#### Syncfusion® assemblies
-
-
-
-
-Platform(s)
-Assemblies
-
-
-
-
-Windows Forms, WPF, ASP.NET, and ASP.NET MVC
-
-
-
-Syncfusion.OCRProcessor.Base.dll
-Syncfusion.Pdf.Base.dll
-Syncfusion.Compression.Base.dll
-Syncfusion.ImagePreProcessor.Base.dll
-
-
-
-
-
-.NET Standard 2.0
-
-
-
-Syncfusion.OCRProcessor.Portable.dll
-Syncfusion.PdfImaging.Portable.dll
-Syncfusion.Pdf.Portable.dll
-Syncfusion.Compression.Portable.dll
-{{'[SkiaSharp](https://www.nuget.org/packages/SkiaSharp/3.119.1)'| markdownify }} package
-Syncfusion.ImagePreProcessor.Portable.dll
-
-
-
-
-
-.NET 8/.NET 9/.NET 10
-
-
-
-Syncfusion.OCRProcessor.NET.dll
-Syncfusion.PdfImaging.NET.dll
-Syncfusion.Pdf.NET.dll
-Syncfusion.Compression.NET.dll
-{{'[SkiaSharp](https://www.nuget.org/packages/SkiaSharp/3.119.1)'| markdownify }} package
-Syncfusion.ImagePreProcessor.NET.dll
-
-
-
-
\ No newline at end of file
diff --git a/Document-Processing/Data-Extraction/OCR/overview.md b/Document-Processing/Data-Extraction/OCR/overview.md
deleted file mode 100644
index 733184a8b0..0000000000
--- a/Document-Processing/Data-Extraction/OCR/overview.md
+++ /dev/null
@@ -1,14 +0,0 @@
----
-title: Intro to OCR Processor | Syncfusion
-description: This page introduces the Syncfusion OCR Processor, describing its purpose, key capabilities, and how to get started with optical character recognition in .NET applications.
-platform: document-processing
-control: OCRProcessor
-documentation: UG
-keywords: OCR, Optical Character Recognition, Text Recognition
----
-
-# Welcome to Syncfusion OCR Processor Library
-
-Syncfusion® OCR Processor is a high‑performance .NET library that enables accurate text recognition from scanned documents, images, and PDF files. Designed for modern .NET workflows, it processes raster images and document pages to recognize printed text, analyze page layouts, and extract textual content programmatically.
-
-The OCR Processor supports common document formats and provides a streamlined API for converting image‑based content into machine‑readable text, making it suitable for scenarios such as document digitization, text search, content indexing, and data processing in enterprise applications.
\ No newline at end of file
diff --git a/Document-Processing/Data-Extraction/Smart-Data-Extractor/NET/Assemblies-Required.md b/Document-Processing/Data-Extraction/Smart-Data-Extractor/NET/Assemblies-Required.md
deleted file mode 100644
index eb871b3030..0000000000
--- a/Document-Processing/Data-Extraction/Smart-Data-Extractor/NET/Assemblies-Required.md
+++ /dev/null
@@ -1,74 +0,0 @@
----
-title: Assemblies required for SmartDataExtractor | Syncfusion
-description: This section details the Syncfusion assemblies required to configure and run Smart Data Extractor seamlessly in .NET projects.
-platform: document-processing
-control: SmartDataExtractor
-documentation: UG
-keywords: Assemblies
----
-# Assemblies Required to work with Smart Data Extractor
-
-The following assemblies need to be referenced in your application based on the platform.
-
-
-
- Platform(s)
- Assembly
-
-
-
-
-
- {{'WPF'| markdownify }},
- {{'Windows Forms'| markdownify }} and {{'ASP.NET MVC'|
- markdownify }}
-
-
- Syncfusion.Compression.Base
- Syncfusion.ImagePreProcessor.Base
- Syncfusion.OCRProcessor.Base
- Syncfusion.Pdf.Base
- Syncfusion.PdfToImageConverter.Base
- Syncfusion.SmartFormRecognizer.Base
- Syncfusion.SmartTableExtractor.Base
- Syncfusion.Markdown
-
-
-
-
- {{'.NET Core'| markdownify }}
- and {{'.NET Platforms'| markdownify }}
-
-
- Syncfusion.Compression.Portable
- Syncfusion.ImagePreProcessor.Portable
- Syncfusion.OCRProcessor.Portable
- Syncfusion.Pdf.Imaging.Portable
- Syncfusion.Pdf.Portable
- Syncfusion.PdfToImageConverter.Portable
- Syncfusion.SmartFormRecognizer.Portable
- Syncfusion.SmartTableExtractor.Portable
- Syncfusion.Markdown
-
-
-
-
- {{'.NET Multi-platform App UI (.NET MAUI)'| markdownify }}
-
-
- Syncfusion.Compression.NET
- Syncfusion.ImagePreProcessor.NET
- Syncfusion.OCRProcessor.NET
- Syncfusion.Pdf.Imaging.NET
- Syncfusion.Pdf.NET
- Syncfusion.PdfToImageConverter.NET
- Syncfusion.SmartFormRecognizer.NET
- Syncfusion.SmartTableExtractor.NET
- Syncfusion.Markdown
-
-
-
-
-
-
-
diff --git a/Document-Processing/Data-Extraction/Smart-Data-Extractor/NET/NuGet-Packages-Required.md b/Document-Processing/Data-Extraction/Smart-Data-Extractor/NET/NuGet-Packages-Required.md
deleted file mode 100644
index caff6b27b2..0000000000
--- a/Document-Processing/Data-Extraction/Smart-Data-Extractor/NET/NuGet-Packages-Required.md
+++ /dev/null
@@ -1,66 +0,0 @@
----
-title: NuGet Packages for SmartDataExtractor | Syncfusion®
-description: Learn the NuGet packages required to use Syncfusion® SmartDataExtractor in various platforms and frameworks.
-platform: document-processing
-control: SmartDataExtractor
-documentation: UG
-keywords: Assemblies
----
-# NuGet Packages Required for Smart Data Extractor
-
-## Extract Structured data from PDF
-
-To work with Smart Data Extractor, the following NuGet packages need to be installed in your application from [nuget.org](https://www.nuget.org/).
-
-
-
-
-Platform(s)
-NuGet Package
-
-
-
-
-Windows Forms
-Console Application (Targeting .NET Framework)
-
-
-{{'[Syncfusion.SmartDataExtractor.WinForms.nupkg](https://www.nuget.org/packages/Syncfusion.SmartDataExtractor.WinForms/)'| markdownify }}
-
-
-
-
-WPF
-
-
-{{'[Syncfusion.SmartDataExtractor.Wpf.nupkg](https://www.nuget.org/packages/Syncfusion.SmartDataExtractor.Wpf)'| markdownify }}
-
-
-
-
-ASP.NET MVC5
-
-
-{{'[Syncfusion.SmartDataExtractor.AspNet.Mvc5.nupkg](https://www.nuget.org/packages/Syncfusion.SmartDataExtractor.AspNet.Mvc5)'| markdownify }}
-
-
-
-
-ASP.NET Core (Targeting NET Core)
-Console Application (Targeting .NET Core)
-
-
-{{'[Syncfusion.SmartDataExtractor.Net.Core.nupkg](https://www.nuget.org/packages/Syncfusion.SmartDataExtractor.Net.Core)'| markdownify }}
-
-
-
-
-Windows UI (WinUI)
-.NET Multi-platform App UI (.NET MAUI)
-
-
-{{'[Syncfusion.SmartDataExtractor.NET.nupkg](https://www.nuget.org/packages/Syncfusion.SmartDataExtractor.NET)'| markdownify }}
-
-
-
-
diff --git a/Document-Processing/Data-Extraction/Smart-Data-Extractor/NET/overview.md b/Document-Processing/Data-Extraction/Smart-Data-Extractor/NET/overview.md
deleted file mode 100644
index 257535b0e3..0000000000
--- a/Document-Processing/Data-Extraction/Smart-Data-Extractor/NET/overview.md
+++ /dev/null
@@ -1,250 +0,0 @@
----
-title: Overview for SmartDataExtractor | Syncfusion
-description: Learn how to detects form data from PDFs and scanned images using Syncfusion® Essential Studio® SmartDataExtractor.
-platform: document-processing
-control: SmartDataExtractor
-documentation: UG
-keywords: Assemblies
----
-
-# Overview of Smart Data Extractor
-
-Syncfusion® Smart Data Extractor is a high performance, deterministic C# library that extracts complete document structures from PDFs and images. Designed for .NET workflows, it analyzes visual layout lines, boxes, labels, and alignment to locate and extract elements such as table structure, text elements, images, headers, footers, and form fields with per field confidence scores for immediate review, export, or integration.
-
-## Key Features of Essential® Smart Data Extractor
-
-The following list shows the key features available in the Essential® SmartDataExtractor.
-
-* **Document structure extraction:** Detects text elements, images, headers/footers, and complete table structure (regions, header rows, columns, cell boundaries, merged cells).
-* **File format support:** Works with PDF and common image formats (JPEG, PNG).
-* **Table extraction:** Specialized parsing to recover table rows, columns, header detection and cell spans.
-* **Form recognition:** Detects and extracts form fields (text inputs, checkboxes, radio buttons) with field types and values.
-* **Page-level control:** Extract data from specific pages or defined page ranges.
-* **Confidence thresholding:** Results are filtered based on a configurable confidence score (0.0–1.0).
-* **Deterministic performance:** Designed for predictable, repeatable extraction across environments (Windows, Linux, Azure, Docker).
-
-## JSON Output Structure and Attributes
-
-The Smart Data Extractor library processes PDFs and scanned images to extract structured document data including tables, form fields, text elements, images, headers, and footers. The extracted output is returned as structured JSON that includes per-field confidence scores and a complete document hierarchy, making it ready for immediate review, export, or integration into downstream workflows.
-
-### Root Structure
-
-Below is the root structure of the JSON result:
-
-{% tabs %}
-
-{% highlight json tabtitle="JSON" %}
-
-{
- "Pages": [
- {
- "PageNumber": 1,
- "Width": 0,
- "Height": 0,
- "PageObjects": [],
- "FormObjects": []
- }
- ]
-}
-
-{% endhighlight %}
-
-{% endtabs %}
-
-### JSON Attributes
-
-#### Page Object
-
-The Page Object represents the metadata of a page along with all the detected elements it contains.
-
-
-
-
-Attribute
-Type
-Description
-
-
-
-
-PageNumber
-Integer
-Sequential number of the page in the document.
-
-
-Width
-Float
-Page width in points/pixels.
-
-
-Height
-Float
-Page height in points/pixels.
-
-
-PageObjects
-Array
-List of detected objects (table).
-
-
-FormObjects
-Array
-List of detected form fields (checkboxes, text boxes, radio button, signature etc..)
-
-
-
-
-#### PageObjects
-
-PageObjects represent detected elements on a page such as text, headers, footers, tables, images, and numbers.
-
-
-
-
-Attribute
-Type
-Description
-
-
-
-
-Type
-String
-Defines the kind of object detected on the page (Table).
-
-
-Bounds
-Array of Floats
-The bounding box coordinates [X, Y, Width, Height] representing the object's position and size on the page.
-
-
-Content
-String
-Extracted text or value associated with the object (if applicable).
-
-
-Confidence
-Float
-Confidence score (0–1) indicating the accuracy of detection.
-
-
-TableFormat (only for tables)
-Object
-Metadata about table detection, including detection score and label.
-
-
-Rows (only for tables)
-Array
-Collection of row objects that make up the table.
-
-
-
-
-#### Row Object
-
-The Row Object represents a single horizontal group of cells within a table, along with its bounding box.
-
-
-
-
-Attribute
-Type
-Description
-
-
-
-
-Type
-String
-Row type (e.g., tr).
-
-
-Rect
-Array
-Bounding box coordinates for the row.
-
-
-Cells
-Array
-Collection of cell objects contained in the row.
-
-
-
-
-#### Cell Object
-
-The Cell Object represents an individual table entry, containing text values, spanning details, and positional coordinates.
-
-
-
-
-Attribute
-Type
-Description
-
-
-
-
-Type
-String
-Cell type (e.g., td).
-
-
-Rect
-Array
-Bounding box coordinates for the cell.
-
-
-RowSpan / ColSpan
-Integer
-Number of rows or columns spanned by the cell.
-
-
-RowStart / ColStart
-Integer
-Starting row and column index of the cell.
-
-
-Content.Value
-String
-Text content inside the cell.
-
-
-
-
-#### FormObjects
-
-FormObjects represent interactive form fields detected on the page, such as text boxes, checkboxes, radio buttons, and signature regions. Each object includes positional data, field dimensions, field type, and a confidence score that reflects the reliability of the detection.
-
-
-
-
-Attribute
-Type
-Description
-
-
-
-
-X / Y
-Float
-Coordinates of the form field on the page.
-
-
-Width / Height
-Float
-Dimensions of the form field.
-
-
-Type
-Integer
-Numeric identifier for the form field type (e.g., 0 = TextArea, 1 = Checkbox, 2 = Radio Button, 3 = Signature).
-
-
-Confidence
-Float
-Confidence score (0–1) indicating detection accuracy.
-
-
-
-
diff --git a/Document-Processing/Data-Extraction/Smart-Data-Extractor/overview.md b/Document-Processing/Data-Extraction/Smart-Data-Extractor/overview.md
deleted file mode 100644
index eacadfaaa2..0000000000
--- a/Document-Processing/Data-Extraction/Smart-Data-Extractor/overview.md
+++ /dev/null
@@ -1,14 +0,0 @@
----
-title: Intro to Smart data extractor| Syncfusion
-description: This page introduces Syncfusion Smart Data Extractor, outlining its purpose, core details, and usage in .NET applications.
-platform: document-processing
-control: SmartDataExtractor
-documentation: UG
-keywords: Assemblies
----
-
-# Welcome to Syncfusion Smart Data Extractor Library
-
-Syncfusion® Smart Data Extractor is a high performance, deterministic C# library that extracts complete document structures from PDFs and images. Designed for .NET workflows, it analyzes visual layout lines, boxes, labels, and alignment to locate and extract elements such as table structure, text elements, images, headers, footers, and form fields with per field confidence scores for immediate review, export, or integration.
-
-
diff --git a/Document-Processing/Data-Extraction/Smart-Form-Recognizer/NET/assemblies-required.md b/Document-Processing/Data-Extraction/Smart-Form-Recognizer/NET/assemblies-required.md
deleted file mode 100644
index 8705698737..0000000000
--- a/Document-Processing/Data-Extraction/Smart-Form-Recognizer/NET/assemblies-required.md
+++ /dev/null
@@ -1,59 +0,0 @@
----
-title: Assemblies required for SmartFormRecognizer| Syncfusion®
-description: Learn the assemblies required to use Syncfusion® SmartFormRecognizer library in various platforms and frameworks.
-platform: document-processing
-control: SmartFormRecognizer
-documentation: UG
----
-
-# Assemblies Required to work with SmartFormRecognizer
-The following assemblies need to be referenced in your application based on the platform.
-
-
-
-
- Platform(s)
- Assembly
- Dependent Assemblies
-
-
-
-
-
- Windows Forms, WPF and ASP.NET MVC5
- Syncfusion.SmartFormRecognizer.Base
-
-
- Syncfusion.Pdf.Base
- Syncfusion.Compression.Base
- Syncfusion.PdfToImageConverter.Base
-
-
-
-
-
-
- Blazor, .NET Core and .NET Platforms
- Syncfusion.SmartFormRecognizer.Portable
-
-
- Syncfusion.Pdf.Portable
- Syncfusion.Compression.Portable
- Syncfusion.PdfToImageConverter.Portable
-
-
-
-
-
-
- Syncfusion.SmartFormRecognizer.NET
-
-
- Syncfusion.Pdf.NET
- Syncfusion.Compression.NET
- Syncfusion.PdfToImageConverter.NET
-
-
-
-
-
diff --git a/Document-Processing/Data-Extraction/Smart-Form-Recognizer/NET/nuGet-packages-required.md b/Document-Processing/Data-Extraction/Smart-Form-Recognizer/NET/nuGet-packages-required.md
deleted file mode 100644
index 767a307bc7..0000000000
--- a/Document-Processing/Data-Extraction/Smart-Form-Recognizer/NET/nuGet-packages-required.md
+++ /dev/null
@@ -1,57 +0,0 @@
----
-title: NuGet Packages for SmartFormRecognizer | Syncfusion®
-description: Learn the NuGet packages required to use Syncfusion® SmartFormRecognizer in various platforms and frameworks.
-platform: document-processing
-control: SmartFormRecognizer
-documentation: UG
----
-
-# NuGet Packages Required for SmartFormRecognizer
-
-To work with SmartFormRecognizer, the following NuGet packages need to be installed in your application.
-
-
-
-
-Platform(s)
-NuGet Package
-
-
-
-
-Windows Forms
-
-
-{{'[Syncfusion.SmartFormRecognizer.WinForms.nupkg]'| markdownify }}
-
-
-
-
-WPF
-
-
-{{'[Syncfusion.SmartFormRecognizer.WPF.nupkg]'| markdownify }}
-
-
-
-Blazor, .NET Core and .NET Platforms
-
-{{'[Syncfusion.SmartFormRecognizer.NET.nupkg]'| markdownify }}
-
-
-
-
-{{'[Syncfusion.SmartFormRecognizer.Net.Core.nupkg]'| markdownify }}
-
-
-
-
-ASP.NET MVC5
-
-
-{{'[Syncfusion.SmartFormRecognizer.AspNet.MVC5.nupkg]'| markdownify }}
-
-
-
-
-N> The above mentioned NuGet packages are available in [nuget.org](https://www.nuget.org/).
diff --git a/Document-Processing/Data-Extraction/Smart-Form-Recognizer/NET/overview.md b/Document-Processing/Data-Extraction/Smart-Form-Recognizer/NET/overview.md
deleted file mode 100644
index b241c3ed1a..0000000000
--- a/Document-Processing/Data-Extraction/Smart-Form-Recognizer/NET/overview.md
+++ /dev/null
@@ -1,45 +0,0 @@
----
-title: Detect Form Fields| Syncfusion®
-description: Learn how to detects form data from PDFs and scanned images using Syncfusion® Essential Studio® SmartFormRecognizer.
-platform: document-processing
-control: SmartFormRecognizer
-documentation: UG
-keywords: Assemblies
----
-
-# Overview of Smart Form Recognizer
-
-Smart Form Recognizer is a deterministic, on premise C# library for .NET designed to reliably detect form data from PDFs and scanned images. Unlike AI‑based approaches, this library uses visual layout heuristics including lines, boxes, and circular markers to identify form structures with high consistency and predictability.It supports to identify the common form controls such as text fields, checkboxes, radio buttons, and signature regions, producing clean, structured JSON that can be fed directly into review and workflow systems.
-
-
-## Core Capabilities
-
-* **Form layout detection**: Locate form regions using graphical heuristics (lines, boxes, circles) for consistent field discovery.
-
-* **Fillable PDF export**: Create a PDF with detected form fields added so documents are immediately usable in form workflows.
-
-* **Page-level control**: Process specific pages or page ranges for targeted extraction.
-
-* **Multi-format support**: Works with PDF, JPEG, PNG and other common image formats.
-
-* **Confidence filtering**: Per-field confidence scores with configurable thresholds to control output quality and drive review logic.
-
-* **Ready for .NET integration**: Deterministic, on premise library that outputs JSON and integrates into existing .NET pipelines and review UIs.
-
-The following code snippet illustrates how to detects form data from PDFs using FormRecognizer method in SmartFormRecognizer.
-
-{% tabs %}
-{% highlight c# tabtitle="C#" %}
-
-//Initialize the Form Recognizer
-FormRecognizer smartFormRecognizer = new FormRecognizer();
-//Read the input PDF file as stream
-FileStream inputStream = new FileStream("Input.pdf", FileMode.Open, FileAccess.ReadWrite);
-//Recognize the form and get the output as PDF stream
-PdfLoadedDocument pdfLoadedDocument = smartFormRecognizer.RecognizeFormAsPdfDocument(inputStream);
-//Save the loadeddocument
-pdfLoadedDocument.Save(Output.pdf);
-
-{% endhighlight %}
-{% endtabs %}
-
diff --git a/Document-Processing/Data-Extraction/Smart-Form-Recognizer/NET/recognize-forms.md b/Document-Processing/Data-Extraction/Smart-Form-Recognizer/NET/recognize-forms.md
deleted file mode 100644
index 08376519a7..0000000000
--- a/Document-Processing/Data-Extraction/Smart-Form-Recognizer/NET/recognize-forms.md
+++ /dev/null
@@ -1,176 +0,0 @@
----
-title: Working with Recognize methods in SmartFormRecognizer| Syncfusion®
-description: Learn how to effectively use the Recognize methods in the Syncfusion® SmartFormRecognizer library to process and detects from forms with ease.
-platform: document-processing
-control: SmartFormRecognizer
-documentation: UG
----
-
-# Recognize Forms Using SmartFormRecognizer
-
-The `FormRecognizer` exposes several convenience methods to recognize forms from a `Stream` input. Each method accepts an input `Stream` (PDF or image) and returns recognized output either as a `PdfLoadedDocument`, a `Stream` containing PDF data, or as a JSON string.
-
-Below each method signature you'll find a explanation and corrected example usage (both synchronous and asynchronous where applicable).
-
-## Recognize forms using PdfLoadedDocument
-Using `PdfLoadedDocument` this operation analyzes the form content supplied through the inputStream whether it contains a PDF or an image and produces a fully enriched PdfLoadedDocument that includes recognized form elements such as checkboxes, radio buttons, textboxes, and signatures, according to the options defined in `FormRecognizeOptions`. This recognition process supports both execution patterns: the synchronous `RecognizeFormAsPdfDocument` method for immediate, blocking processing, and the asynchronous `RecognizeFormAsPdfDocumentAsync` method for non‑blocking, await processing ideal for responsive UI applications or scalable server side workflows.
-
-Example (synchronous):
-
-{% tabs %}
-{% highlight c# tabtitle="C#" %}
-public void Button_Click(object sender, RoutedEventArgs e)
- {
- //Initialize the Form Recognizer
- FormRecognizer smartFormRecognizer = new FormRecognizer();
- //Read the input PDF file as stream
- FileStream inputStream = new FileStream("Input.pdf", FileMode.Open, FileAccess.ReadWrite);
- //Recognize the form and get the output as PDF stream
- PdfLoadedDocument pdfLoadedDocument = smartFormRecognizer.RecognizeFormAsPdfDocument(inputStream);
- //Save the loadeddocument
- pdfLoadedDocument.Save(Output.pdf);
- }
-{% endhighlight %}
-{% endtabs %}
-
-
-Example (asynchronous):
-
-{% tabs %}
-{% highlight c# tabtitle="C#" %}
-public async void Button_Click(object sender, RoutedEventArgs e)
- {
- //Initialize the Form Recognizer
- FormRecognizer smartFormRecognizer = new FormRecognizer();
- //Read the input PDF file as stream
- FileStream inputStream = new FileStream("Input.pdf", FileMode.Open, FileAccess.ReadWrite);
- //Recognize the form and get the output as PDF stream
- PdfLoadedDocument pdfLoadedDocument = await smartFormRecognizer.RecognizeFormAsPdfDocumentAsync(inputStream);
- //Save the loadeddocument
- pdfLoadedDocument.Save(Output.pdf);
-}
-
-{% endhighlight %}
-{% endtabs %}
-
-
-## Recognize forms using Stream
-Using `Stream`this operation processes the form content provided through the inputStream whether it contains a PDF or an image and returns the fully recognized PDF as a Stream.This functionality is available through both the synchronous `RecognizeFormAsPdfStream` method for immediate, blocking execution and the asynchronous `RecognizeFormAsPdfStreamAsync` method for non‑blocking, await processing suitable for responsive UI applications, background services, and scalable server‑side workflows.
-
-Example (synchronous):
-
-{% tabs %}
-{% highlight c# tabtitle="C#" %}
-public void Button_Click(object sender, RoutedEventArgs e)
-{
- //Initialize the Form Recognizer
- FormRecognizer smartFormRecognizer = new FormRecognizer();
- //Read the input PDF file as stream
- FileStream inputStream = new FileStream("Input.pdf", FileMode.Open, FileAccess.ReadWrite);
- //Recognize the form and get the output as PDF stream
- Stream outputStream = smartFormRecognizer.RecognizeFormAsPdfStream(inputStream);
- //Save the output PDF stream to file
- using (FileStream fileStream = File.Create("Output.pdf"))
- {
- outputStream.Seek(0, SeekOrigin.Begin);
- outputStream.CopyTo(fileStream);
- }
-}
-{% endhighlight %}
-{% endtabs %}
-
-
-Example (asynchronous):
-
-{% tabs %}
-{% highlight c# tabtitle="C#" %}
-public async void Button_Click(object sender, RoutedEventArgs e)
-{
- //Initialize the Form Recognizer
- FormRecognizer smartFormRecognizer = new FormRecognizer();
- //Read the input PDF file as stream
- FileStream inputStream = new FileStream("Input.pdf", FileMode.Open, FileAccess.ReadWrite);
- //Recognize the form and get the output as PDF stream
- Stream outputStream = await smartFormRecognizer.RecognizeFormAsPdfStreamAsync(inputStream);
- //Save the output PDF stream to file
- using (FileStream fileStream = File.Create("Output.pdf"))
- {
- outputStream.Seek(0, SeekOrigin.Begin);
- outputStream.CopyTo(fileStream);
- }
-}
-
-{% endhighlight %}
-{% endtabs %}
-
-## Recognize forms using JSON
-
-Using `JSON`this operation recognizes the form contained in the inputStream whether the source document is a PDF or an image and returns the complete recognition output serialized as a JSON string. This functionality is accessible through both the synchronous `RecognizeFormAsJson` method, which performs immediate, blocking processing, and the asynchronous `RecognizeFormAsJsonAsync` method, which provides non‑blocking, await execution suitable for UI applications, background workers, and scalable cloud or server‑side workflows.
-
-Example (synchronous):
-
-{% tabs %}
-{% highlight c# tabtitle="C#" %}
-public void Button_Click(object sender, RoutedEventArgs e)
-{
- //Initialize the Form Recognizer
- FormRecognizer smartFormRecognizer = new FormRecognizer();
- //Read the input PDF file as stream
- FileStream inputStream = new FileStream("Input.pdf", FileMode.Open, FileAccess.ReadWrite);
- //Recognize the form and get the output as PDF stream
- String outputJson = smartFormRecognizer.RecognizeFormAsJson(inputStream);
- //Save the outputJson
- File.Create("D:\\result.json").Close();
- File.WriteAllText("D:\\result.json", outputJson);
-}
-{% endhighlight %}
-{% endtabs %}
-
-
-Example (asynchronous):
-
-{% tabs %}
-{% highlight c# tabtitle="C#" %}
-public async void Button_Click(object sender, RoutedEventArgs e)
-{
- //Initialize the Form Recognizer
- FormRecognizer smartFormRecognizer = new FormRecognizer();
- //Read the input PDF file as stream
- FileStream inputStream = new FileStream("Input.pdf", FileMode.Open, FileAccess.ReadWrite);
- //Recognize the form and get the output as PDF stream
- String outputJson = await smartFormRecognizer.RecognizeFormAsJsonAsync(inputStream);
- //Save the outputJson
- File.Create("D:\\result.json").Close();
- File.WriteAllText("D:\\result.json", outputJson);
-
-}
-
-{% endhighlight %}
-{% endtabs %}
-
-## Async variants with CancellationToken
-
-The async overloads accept an optional `CancellationToken` to cancel long running operations. Initially, the cancellationToken uses its default value, and based on our requirements, we can optimize or supply a custom token as needed. This behavior is supported across all async methods, including PDF, JSON, and PdfLoadedDocument asynchronous operations
-
-
-Example with cancellation token (PDF stream):
-
-{% tabs %}
-{% highlight c# tabtitle="C#" %}
-
-public async Task RecognizeWithCancellationAsync()
-{
- FormRecognizer recognizer = new FormRecognizer();
-
- using FileStream inputStream = new FileStream("Input.pdf", FileMode.Open, FileAccess.Read);
- CancellationTokenSource cts = new CancellationTokenSource();
- cts.CancelAfter(TimeSpan.FromSeconds(5)); // cancel in 5 seconds
- CancellationToken token = cts.Token;
- using Stream resultStream = await recognizer.RecognizeFormAsPdfStreamAsync(inputStream, token);
-
- using FileStream fileStream = File.Create("Output.pdf");
- await resultStream.CopyToAsync(fileStream, token);
-}
-
-{% endhighlight %}
-{% endtabs %}
\ No newline at end of file
diff --git a/Document-Processing/Data-Extraction/Smart-Form-Recognizer/NET/working-with-recognize-option.md b/Document-Processing/Data-Extraction/Smart-Form-Recognizer/NET/working-with-recognize-option.md
deleted file mode 100644
index b947bed001..0000000000
--- a/Document-Processing/Data-Extraction/Smart-Form-Recognizer/NET/working-with-recognize-option.md
+++ /dev/null
@@ -1,96 +0,0 @@
----
-title: Using FormRecognizeOptions in SmartFormRecognizer| Syncfusion®
-description: Discover how to configure and apply FormRecognizeOptions in the Syncfusion® SmartFormRecognizer library to enhance form processing workflows.
-platform: document-processing
-control: SmartFormRecognizer
-documentation: UG
----
-
-# Working with FormRecognizeOptions
-
-`FormRecognizeOptions` provides configurable settings that control how the SmartFormRecognizer detects elements from a document. It allows you to enable or disable the detection of specific form controls such as checkboxes, radio buttons, textboxes, and signatures—while also letting you fine tune the recognition results using a confidence threshold.
-Additionally, it supports restricting processing to specific pages through an optional 1‑based inclusive PageRange. By adjusting these options, developers can optimize performance, reduce noise in results, and tailor form extraction precisely to the needs of their application
-
-## Properties
-
-### DetectTextboxes
-`DetectTextboxes` is a boolean property in FormRecognizeOptions that determines whether the form recognizer should detect textbox fields from the document layout. When enabled (default: true), it identifies rectangular input areas such as printed boxes,Rounded boxes, lines, boxes and line inside the table cell, or bordered regions intended for user written text and includes them in the recognition output.
-
-{% tabs %}
-{% highlight c# tabtitle="C#" %}
-
-FormRecognizer recognizer = new FormRecognizer();
-// Disable textbox detection
-recognizer.FormRecognizeOptions.DetectTextboxes = false;
-
-{% endhighlight %}
-{% endtabs %}
-
-### DetectCheckboxes
-`DetectCheckboxes` is a boolean option in FormRecognizeOptions that controls whether the form recognizer should identify checkbox elements during document processing. When enabled (default: true), the recognizer scans the page layout for checkbox shapes,Rounded square and determines their positions.
-
-{% tabs %}
-{% highlight c# tabtitle="C#" %}
-
-FormRecognizer recognizer = new FormRecognizer();
-// Disable checkbox detection
-recognizer.FormRecognizeOptions.DetectCheckboxes = false;
-
-{% endhighlight %}
-{% endtabs %}
-
-### DetectRadioButtons
-`DetectRadioButtons` is a boolean property in FormRecognizeOptions that specifies whether the form recognizer should detect radio button elements in the document. When enabled (default: true), the recognizer identifies circular and oval objects in images or in the PDF documents then add radio buttons in that identified locations.
-
-{% tabs %}
-{% highlight c# tabtitle="C#" %}
-
-FormRecognizer recognizer = new FormRecognizer();
-// Disable radio button detection
-recognizer.FormRecognizeOptions.DetectRadioButtons = false;
-
-{% endhighlight %}
-{% endtabs %}
-
-### DetectSignatures
-`DetectSignatures` is a boolean property in FormRecognizeOptions that controls whether the form recognizer should identify signature fields within a document. When enabled (default: true), the recognizer scans for handwritten style areas, signature lines, or regions typically used for signing, and includes these detected signature bounds in the output.
-
-{% tabs %}
-{% highlight c# tabtitle="C#" %}
-
-FormRecognizer recognizer = new FormRecognizer();
-// Disable signature detection
-recognizer.FormRecognizeOptions.DetectSignatures = false;
-
-{% endhighlight %}
-{% endtabs %}
-
-### ConfidenceThreshold
-`ConfidenceThreshold` is a double value in FormRecognizeOptions that defines the minimum confidence score (ranging from 0.0 to 1.0) required for any detected form element to be included in the final recognition output. A higher threshold (e.g., 0.9) filters out lower‑certainty detections and ensures only highly reliable results are returned, while a lower threshold increases sensitivity by allowing more detections at the cost of potentially reduced accuracy.
-
-{% tabs %}
-{% highlight c# tabtitle="C#" %}
-
-FormRecognizer recognizer = new FormRecognizer();
-// Set a ConfidenceThreshold
-recognizer.FormRecognizeOptions.ConfidenceThreshold = 0.9;
-
-{% endhighlight %}
-{% endtabs %}
-
-### PageRange
-`PageRange` is an optional int[,]? property in FormRecognizeOptions that allows you to control exactly which pages of a document the form recognizer should process. Each row in this 2‑dimensional array represents a 1‑based inclusive range in the form [start, end], the recognizer processes all pages in the document. Defining page ranges helps improve performance, reduce unnecessary processing, and target only the sections of the document relevant to your extraction workflow.We can also provide values in single page.Also If we provide values in descending order it will consider as ascending order and perform detection.
-
-{% tabs %}
-{% highlight c# tabtitle="C#" %}
-
-FormRecognizer recognizer = new FormRecognizer();
-// Set a single page range – detects only the specified page
-recognizer.FormRecognizeOptions.PageRange = new int[,] { { 3 }, { 8 } };
-
-// Set a page range – detects content between the specified start and end page
-recognizer.FormRecognizeOptions.PageRange = new int[,] { { 3, 8 } };
-
-{% endhighlight %}
-{% endtabs %}
-
diff --git a/Document-Processing/Data-Extraction/Smart-Form-Recognizer/overview.md b/Document-Processing/Data-Extraction/Smart-Form-Recognizer/overview.md
deleted file mode 100644
index 1fd0d6eac0..0000000000
--- a/Document-Processing/Data-Extraction/Smart-Form-Recognizer/overview.md
+++ /dev/null
@@ -1,92 +0,0 @@
----
-title: Smart Form Recognizer | Syncfusion®
-description: Learn how to detects form data from PDFs and scanned images using Syncfusion® Essential Studio® SmartFormRecognizer.
-platform: document-processing
-control: SmartFormRecognizer
-documentation: UG
-keywords: Assemblies
----
-
-# Welcome to Syncfusion Smart Form Recognizer Library
-
-Smart Form Recognizer is a deterministic, on premise C# library for .NET designed to reliably detect form data from PDFs and scanned images. Unlike AI‑based approaches, this library uses visual layout heuristics including lines, boxes, and circular markers to identify form structures with high consistency and predictability.It supports to identify the common form controls such as text fields, checkboxes, radio buttons, and signature regions, producing clean, structured JSON that can be fed directly into review and workflow systems.
-
-
-## Core Capabilities
-
-* **Form layout detection**: Locate form regions using graphical heuristics (lines, boxes, circles) for consistent field discovery.
-
-* **Fillable PDF export**: Create a PDF with detected form fields added so documents are immediately usable in form workflows.
-
-* **Page-level control**: Process specific pages or page ranges for targeted extraction.
-
-* **Multi-format support**: Works with PDF, JPEG, PNG and other common image formats.
-
-* **Confidence filtering**: Per-field confidence scores with configurable thresholds to control output quality and drive review logic.
-
-* **Ready for .NET integration**: Deterministic, on premise library that outputs JSON and integrates into existing .NET pipelines and review UIs.
-
-
-NuGet
-
-
-
-Platform(s)
-NuGet Package
-
-
-
-
-Windows Forms
-
-
-{{'[Syncfusion.SmartFormRecognizer.WinForms.nupkg]'| markdownify }}
-
-
-
-
-WPF
-
-
-{{'[Syncfusion.SmartFormRecognizer.WPF.nupkg]'| markdownify }}
-
-
-
-Blazor, .NET Core and .NET Platforms
-
-{{'[Syncfusion.SmartFormRecognizer.NET.nupkg]'| markdownify }}
-
-
-
-
-{{'[Syncfusion.SmartFormRecognizer.Net.Core.nupkg]'| markdownify }}
-
-
-
-
-ASP.NET MVC5
-
-
-{{'[Syncfusion.SmartFormRecognizer.AspNet.MVC5.nupkg]'| markdownify }}
-
-
-
-
-N> The above mentioned NuGet packages are available in [nuget.org](https://www.nuget.org/).
-
-The following code snippet illustrates how to detects form data from PDFs using FormRecognizer method in SmartFormRecognizer.
-
-{% tabs %}
-{% highlight c# tabtitle="C#" %}
-
-//Initialize the Form Recognizer
-FormRecognizer smartFormRecognizer = new FormRecognizer();
-//Read the input PDF file as stream
-FileStream inputStream = new FileStream("Input.pdf", FileMode.Open, FileAccess.ReadWrite);
-//Recognize the form and get the output as PDF stream
-PdfLoadedDocument pdfLoadedDocument = smartFormRecognizer.RecognizeFormAsPdfDocument(inputStream);
-//Save the loadeddocument
-pdfLoadedDocument.Save(Output.pdf);
-
-{% endhighlight %}
-{% endtabs %}
diff --git a/Document-Processing/Data-Extraction/Smart-Table-Extractor/NET/Assemblies-Required.md b/Document-Processing/Data-Extraction/Smart-Table-Extractor/NET/Assemblies-Required.md
deleted file mode 100644
index 7e6565d838..0000000000
--- a/Document-Processing/Data-Extraction/Smart-Table-Extractor/NET/Assemblies-Required.md
+++ /dev/null
@@ -1,68 +0,0 @@
----
-title: Assemblies required for SmartTableExtractor | Syncfusion
-description: This section details the Syncfusion assemblies required to configure and run Smart Table Extractor seamlessly in .NET projects.
-platform: document-processing
-control: PDF
-documentation: UG
-keywords: Assemblies
----
-# Assemblies Required to work with Smart Table Extractor
-
-The following assemblies need to be referenced in your application based on the platform.
-
-
-
- Platform(s)
- Assembly
-
-
-
-
-
- {{'WPF'| markdownify }},
- {{'Windows Forms'| markdownify }} and {{'ASP.NET MVC'|
- markdownify }}
-
-
- Syncfusion.Compression.Base
- Syncfusion.ImagePreProcessor.Base
- Syncfusion.OCRProcessor.Base
- Syncfusion.Pdf.Base
- Syncfusion.PdfToImageConverter.Base
- Syncfusion.Markdown
-
-
-
-
- {{'.NET Core'| markdownify }}
- and {{'.NET Platforms'| markdownify }}
-
-
- Syncfusion.Compression.Portable
- Syncfusion.ImagePreProcessor.Portable
- Syncfusion.OCRProcessor.Portable
- Syncfusion.Pdf.Imaging.Portable
- Syncfusion.Pdf.Portable
- Syncfusion.PdfToImageConverter.Portable
- Syncfusion.Markdown
-
-
-
-
- {{'.NET Multi-platform App UI (.NET MAUI)'| markdownify }}
-
-
- Syncfusion.Compression.NET
- Syncfusion.ImagePreProcessor.NET
- Syncfusion.OCRProcessor.NET
- Syncfusion.Pdf.Imaging.NET
- Syncfusion.Pdf.NET
- Syncfusion.PdfToImageConverter.NET
- Syncfusion.Markdown
-
-
-
-
-
-
-
diff --git a/Document-Processing/Data-Extraction/Smart-Table-Extractor/NET/NuGet-Packages-Required.md b/Document-Processing/Data-Extraction/Smart-Table-Extractor/NET/NuGet-Packages-Required.md
deleted file mode 100644
index af443c5a31..0000000000
--- a/Document-Processing/Data-Extraction/Smart-Table-Extractor/NET/NuGet-Packages-Required.md
+++ /dev/null
@@ -1,66 +0,0 @@
----
-title: NuGet Packages for Smart Table Extractor| Syncfusion
-description: Learn the NuGet packages required to use Syncfusion® Smart Table Extractor in various platforms and frameworks.
-platform: document-processing
-control: PDF
-documentation: UG
-keywords: Assemblies
----
-# NuGet Packages Required for Smart Table Extractor
-
-## Extract Structured data from PDF
-
-To work with Smart Table Extractor, the following NuGet packages need to be installed in your application from [nuget.org](https://www.nuget.org/).
-
-
-
-
-Platform(s)
-NuGet Package
-
-
-
-
-Windows Forms
-Console Application (Targeting .NET Framework)
-
-
-{{'[Syncfusion.SmartTableExtractor.WinForms.nupkg](https://www.nuget.org/packages/Syncfusion.SmartTableExtractor.WinForms/)'| markdownify }}
-
-
-
-
-WPF
-
-
-{{'[Syncfusion.SmartTableExtractor.Wpf.nupkg](https://www.nuget.org/packages/Syncfusion.SmartTableExtractor.Wpf)'| markdownify }}
-
-
-
-
-ASP.NET MVC5
-
-
-{{'[Syncfusion.SmartTableExtractor.AspNet.Mvc5.nupkg](https://www.nuget.org/packages/Syncfusion.SmartTableExtractor.AspNet.Mvc5)'| markdownify }}
-
-
-
-
-ASP.NET Core (Targeting NET Core)
-Console Application (Targeting .NET Core)
-
-
-{{'[Syncfusion.SmartTableExtractor.Net.Core.nupkg](https://www.nuget.org/packages/Syncfusion.SmartTableExtractor.Net.Core)'| markdownify }}
-
-
-
-
-Windows UI (WinUI)
-.NET Multi-platform App UI (.NET MAUI)
-
-
-{{'[Syncfusion.SmartTableExtractor.NET.nupkg](https://www.nuget.org/packages/Syncfusion.SmartTableExtractor.NET)'| markdownify }}
-
-
-
-
diff --git a/Document-Processing/Data-Extraction/Smart-Table-Extractor/NET/overview.md b/Document-Processing/Data-Extraction/Smart-Table-Extractor/NET/overview.md
deleted file mode 100644
index 24cd27c1c3..0000000000
--- a/Document-Processing/Data-Extraction/Smart-Table-Extractor/NET/overview.md
+++ /dev/null
@@ -1,206 +0,0 @@
----
-title: Overview for SmartTableExtractor | Syncfusion
-description: Learn how to detects form data from PDFs and scanned images using Syncfusion® Essential Studio® SmartTableExtractor.
-platform: document-processing
-control: PDF
-documentation: UG
-keywords: Assemblies
----
-
-# Overview of Smart Table Extractor
-
-Syncfusion® Smart Table Extractor is a high accuracy, deterministic C# library that detects and extracts tabular data from PDFs and scanned images for .NET workflows, It detects table regions, header rows, columns, and cell spans (merged cells) and provides per-cell confidence scores and structured exports ready for downstream processing.
-
-## Key Features of Essential® Smart Table Extractor
-
-The following list shows the key features available in the Essential® SmartTableExtractor.
-
-* **Table structure extraction:** Identifies table regions, header rows, columns, row and column spans, and cell boundaries.
-* **File format support:** Works with PDF and common image formats (JPEG, PNG).
-* **Border type handling:** Extract both bordered and border-less tables.
-* **Page-level control:** Extract tables from specific pages or defined page ranges.
-* **Confidence thresholding:** Results are filtered based on a configurable confidence score (0.0–1.0).
-* **Deterministic performance:** Designed for predictable, repeatable extraction across environments (Windows, Linux, Azure, Docker).
-
-## JSON Output Structure and Attributes
-
-The Smart Table Extractor library extracts tabular data from PDFs and scanned images by analyzing table regions, borders, alignment patterns, and cell structures. It returns structured JSON with per-cell confidence scores and complete table hierarchy information.
-
-### Root Structure
-
-Below is the root structure of the JSON result:
-
-{% tabs %}
-
-{% highlight json tabtitle="JSON" %}
-
-{
- "Pages": [
- {
- "PageNumber": 1,
- "Width": 0,
- "Height": 0,
- "PageObjects": []
- }
- ]
-}
-
-{% endhighlight %}
-
-{% endtabs %}
-
-### JSON Attributes
-
-#### Page Object
-
-The Page Object represents the metadata of a page along with the table elements it contains.
-
-
-
-
-Attribute
-Type
-Description
-
-
-
-
-PageNumber
-Integer
-Sequential number of the page in the document.
-
-
-Width
-Float
-Page width in points/pixels.
-
-
-Height
-Float
-Page height in points/pixels.
-
-
-PageObjects
-Array
-List of detected objects (table).
-
-
-
-
-#### PageObjects
-
-PageObjects represent detected table elements on a page.
-
-
-
-
-Attribute
-Type
-Description
-
-
-
-
-Type
-String
-Defines the kind of object detected on the page (Table).
-
-
-Bounds
-Array of Floats
-The bounding box coordinates [X, Y, Width, Height] representing the object's position and size on the page.
-
-
-Content
-String
-Extracted text or value associated with the object (if applicable).
-
-
-Confidence
-Float
-Confidence score (0–1) indicating the accuracy of detection.
-
-
-TableFormat (only for tables)
-Object
-Metadata about table detection, including detection score and label.
-
-
-Rows (only for tables)
-Array
-Collection of row objects that make up the table.
-
-
-
-
-#### Row Object
-
-The Row Object represents a single horizontal group of cells within a table, along with its bounding box.
-
-
-
-
-Attribute
-Type
-Description
-
-
-
-
-Type
-String
-Row type (e.g., tr).
-
-
-Rect
-Array
-Bounding box coordinates for the row.
-
-
-Cells
-Array
-Collection of cell objects contained in the row.
-
-
-
-
-#### Cell Object
-
-The Cell Object represents an individual table entry, containing text values, spanning details, and positional coordinates.
-
-
-
-
-Attribute
-Type
-Description
-
-
-
-
-Type
-String
-Cell type (e.g., td).
-
-
-Rect
-Array
-Bounding box coordinates for the cell.
-
-
-RowSpan / ColSpan
-Integer
-Number of rows or columns spanned by the cell.
-
-
-RowStart / ColStart
-Integer
-Starting row and column index of the cell.
-
-
-Content.Value
-String
-Text content inside the cell.
-
-
-
\ No newline at end of file
diff --git a/Document-Processing/Data-Extraction/Smart-Table-Extractor/NET/table-extraction-images/onnx-table.png b/Document-Processing/Data-Extraction/Smart-Table-Extractor/NET/table-extraction-images/onnx-table.png
deleted file mode 100644
index 4e67e28200..0000000000
Binary files a/Document-Processing/Data-Extraction/Smart-Table-Extractor/NET/table-extraction-images/onnx-table.png and /dev/null differ
diff --git a/Document-Processing/Data-Extraction/Smart-Table-Extractor/NET/troubleshooting.md b/Document-Processing/Data-Extraction/Smart-Table-Extractor/NET/troubleshooting.md
deleted file mode 100644
index ddc7e74962..0000000000
--- a/Document-Processing/Data-Extraction/Smart-Table-Extractor/NET/troubleshooting.md
+++ /dev/null
@@ -1,99 +0,0 @@
----
-title: Troubleshoot SmartTableExtractor in DataExtraction | Syncfusion
-description: Troubleshooting steps and FAQs for Syncfusion SmartTableExtractor to help users resolve common issues in .NET Framework projects.
-platform: document-processing
-control: SmartTableExtractor
-documentation: UG
----
-
-# Troubleshooting and FAQ for Smart Table Extractor
-
-## ONNX file missing
-
-
-
- Exception
- ONNX files are missing
-
-
- Reason
- The required ONNX model files are not copied into the application’s build output.
-
-
- Solution
-
- Ensure that the runtimes folder is copied properly to the bin folder of the application from the NuGet package location.
-
- Please refer to the below screenshot,
-
-
-
- Note: If you publish your application, ensure the runtimes/models folder and ONNX files are included in the publish output.
-
-
-
-
-## System.TypeInitializationException / FileNotFoundException – Microsoft.ML.ONNXRuntime
-
-
-
- Exception
-
- 1. System.TypeInitializationException
- 2. FileNotFoundException (Microsoft.ML.ONNXRuntime)
-
-
-
- Reason
-
- The required Microsoft.ML.ONNXRuntime NuGet package is not installed in your project.
- SmartTableExtractor depends on this package and its required assemblies to function properly.
-
-
-
- Solution
-
- Install the NuGet package
- Microsoft.ML.ONNXRuntime (Version 1.18.0) manually in your sample/project.
- This package is required for SmartTableExtractor across .NET Framework projects.
-
-
-
-
-## ONNXRuntimeException – Model File Not Found in MVC Project
-
-
-
-Exception
-
-Microsoft.ML.ONNXRuntime.ONNXRuntimeException
-
-
-
-Reason
-
-The required native runtime library (ONNXRuntime.dll) is missing from your application's bin folder.
-
-
-
-
-Solution
-
- In your MVC project file (.csproj), add the following build target to copy the native DLL from the NuGet package folder to the bin folder:
-{% tabs %}
-{% highlight C# %}
-
-
-
-
-
-{% endhighlight %}
-{% endtabs %}
-
-
-
-
-
diff --git a/Document-Processing/Data-Extraction/Smart-Table-Extractor/overview.md b/Document-Processing/Data-Extraction/Smart-Table-Extractor/overview.md
deleted file mode 100644
index 0fcb679de7..0000000000
--- a/Document-Processing/Data-Extraction/Smart-Table-Extractor/overview.md
+++ /dev/null
@@ -1,13 +0,0 @@
----
-title: Intro for smart Table extractor | Syncfusion
-description: This page introduces Syncfusion Smart Table Extractor, explaining its purpose, features, and usage details for .NET applications.
-platform: document-processing
-control: PDF
-documentation: UG
-keywords: Assemblies
----
-
-# Welcome to Syncfusion Smart Table Extractor Library
-
-Syncfusion® Smart Table Extractor is a high accuracy, deterministic C# library that detects and extracts tabular data from PDFs and scanned images for .NET workflows, It detects table regions, header rows, columns, and cell spans (merged cells) and provides per-cell confidence scores and structured exports ready for downstream processing.
-
diff --git a/Document-Processing/Data-Extraction/overview.md b/Document-Processing/Data-Extraction/overview.md
index 6dcdc8c0af..1f39afbb64 100644
--- a/Document-Processing/Data-Extraction/overview.md
+++ b/Document-Processing/Data-Extraction/overview.md
@@ -1,6 +1,6 @@
---
title: Extract structured data from PDF & image Files in .NET | Syncfusion
-description: Syncfusion® Smart Data Extractor is a .NET library that extracts tables, forms, text, and images from documents, producing PDFs and JSON.
+description: Syncfusion® Smart Data Extractor is a .NET library that extracts tables, forms, text, and images from documents, outputing PDFs and JSON.
platform: document-processing
control: SmartDataExtractor
documentation: UG
@@ -9,12 +9,16 @@ keywords: Assemblies
# Welcome to Syncfusion Data Extraction Library
-Syncfusion® *Smart Data Extractor* is a high performance, deterministic C# library that extracts complete document structures from PDFs and images.
+Syncfusion® **Smart Data Extractor** is a high‑performance, deterministic C# library for extracting complete document structures from PDFs and images.
## List of Data Extraction Libraries
-* SmartDataExtractor - analyzes visual layout lines, boxes, labels, and alignment to locate and extract elements such as table structure, text elements, images, headers, footers, and form fields with per-field confidence scores for immediate review, export, or integration.
-* SmartTableExtractor - detects table regions, header rows, columns, and cell spans (merged cells) and provides per-cell confidence scores and structured exports ready for downstream processing.
-* SmartFormRecognizer - analyzes visual layout cues—lines, boxes and circles—it locates form regions and extracts common controls (text fields, checkboxes, radio buttons, signatures), producing clean JSON with per-field confidence scores. Where form fields are detected, the library can also produce a fillable PDF with corresponding form fields added for immediate use.
+* **Smart Data Extractor** - analyzes visual layout lines, boxes, labels, and alignment to identify and extract elements such as tables, text blocks, images, headers, footers, and form fields. Each element is returned with per‑field confidence scores for immediate review, export, or integration.
+* **Smart Table Extractor** - detects table regions, header rows, columns, and merged cells (cell spans). Provides per‑cell confidence scores and delivers structured exports ready for downstream processing.
+* **Smart Form Recognizer** - analyzes layout cues such as lines, boxes, and circles to detect form regions. It extracts common controls including text fields, checkboxes, radio buttons, and signature fields, producing clean JSON output with confidence scores. When form fields are identified, the library can also generate a fillable PDF for immediate use.
+
+* **Optical character recognition (OCR)** - a high‑performance .NET library for accurate text recognition from scanned documents, images, and PDF files. It processes raster images and document pages to recognize printed text, analyze page layouts, and extract textual content programmatically.
+
+* **Conversion** – extracts data from PDFs or images and produces output in developer‑friendly formats such as JSON and Markdown (MD), enabling seamless integration into applications.