Find Jobs
Hire Freelancers

Scan pdf and read tables with OpenCV & Tesseract OCR

$250-750 USD

Terminado
Publicado hace alrededor de 5 años

$250-750 USD

Pagado a la entrega
Project Mission: Search and find all tables in a PDF Convert images of tables from PDF (or other) to CSV-formatted tables. Mainly pdf but must use OCR because not all PDF are formatted for parsing Must be able to handle tables that are on two pages (see standard bank report pg 12/13) Requirements: OpenCV (Python) Tesseract v4 A set images of pdfs will be provided. It's important not to optimize the solution for these specific tables. The solution must be generic and will be tested against other images of tables. It is a priority to handle regular tables with high precision. Pie-charts and similar diagrams are a bonus. Proposed steps: 1. Analyze images using OpenCV to determine table cells (rows and columns). 2. Slice input image into multiple images based on cells. 2. Use Tesseract 4 to OCR text from each cell. 4. Output data to CSV Expected outcome: - Conversion is at least 95% accurate with our test-set. Standard tables but not provided to avoid overfitting. - Docker image with all dependencies provided. - Function / Script / API that takes an image and outputs CSV-table. Readings / Links: Improving quality: [login to view URL] Finding text blocks in an image using OpenCV: [login to view URL] Table Analysis using with histogram: [login to view URL] Docker OpenCV Image: [login to view URL] Attached files: pdfs to convert
ID del proyecto: 18574010

Información sobre el proyecto

6 propuestas
Proyecto remoto
Activo hace 5 años

¿Buscas ganar dinero?

Beneficios de presentar ofertas en Freelancer

Fija tu plazo y presupuesto
Cobra por tu trabajo
Describe tu propuesta
Es gratis registrarse y presentar ofertas en los trabajos
Adjudicado a:
Avatar del usuario
Hello sir. I have been working on web technologies since 6 years and studying machine learning since 2 years. I have worked on OCR and OpenCV for reading images and emotion detection projects. Ready to start your job. Please ping if interested.
$500 USD en 10 días
4,3 (11 comentarios)
3,6
3,6
6 freelancers están ofertando un promedio de $1.101 USD por este trabajo
Avatar del usuario
Hi there, I've read your project description and I am confident enough that I can handle this project according to your expectations. I have done similar projects before and I want to take over this project as well. If you're interested then please contact me to see my portfolio :) I'll be waiting for your response. Regards
$550 USD en 18 días
5,0 (9 comentarios)
5,5
5,5
Avatar del usuario
Hi there Roaya is a startup based in Egypt and we are Odoo official partner. We are ready to start working on your project. Please let us discuss the details. Regards Mohammad Alaa
$2.000 USD en 20 días
4,9 (19 comentarios)
5,0
5,0
Avatar del usuario
I have already worked on Ocr with PDF . and extract text from it . so I can do your job within a time limit with your satisfaction.
$1.888 USD en 30 días
4,1 (2 comentarios)
2,9
2,9
Avatar del usuario
Hi,dear. I am very interested in your project - 'Scan pdf and read tables with OpenCV & Tesseract OCR'. I've already done this kind of project before. I'm a professional programmer with 12 years of experience. If you award me, I'll implement all of your requirements in a short time. Skills: Java, Machine Learning, Python, Software Development
$555 USD en 3 días
0,0 (0 comentarios)
0,0
0,0
Avatar del usuario
I'm a senior software developer with very a high personal standard for code quality and I pay attention to detail. I have been programming full-time for more than 10 years. Some of my experience is summarized below: ➢ Java 7 & 8 (6+ years experience) ▪ Android, Java EE(J2EE), J2ME, JSF, JSP, PhoneGap ▪ Gradle, Maven, Ant ▪ Spring, Hibernate, MyBatis, EJB ▪ Jboss/Wildfly, Tomcat, Weblogic ▪ TestNG, JUnit, Mockito ▪ Swagger, Dropwizard, JAXB, Axis2 ➢ C# (.NET Core + Standard + Framework) ▪ Dapper & Entity Framework ▪ NUnit ➢ SQL (10+ years experience) ▪ MySQL, MSSQL, Stored Procedures ➢ Oracle (+- 1 year experience) ▪ PL SQL, Stored Procedures ➢ HTML (+HTML 5, 10+ years experience) ▪ JSON, JavaScript, CSS, AJAX, XML, YAML ➢ PHP (10+ years experience) ➢ C++ (3+ years experience) ➢ Pure C (2+ years experience) ➢ Cisco IOS (2+ years experience) ➢ Perl (2+ years experience) ➢ SH (10+ years experience) ➢ BASH (10+ years experience) ➢ Clarion (version 8 & version 10) ➢ Python ➢ VB (.NET) ➢ Delphi ➢ Assembly I am very proficient with Linux/Unix which I have used for more than 10 years with KDE, Gnome, Fluxbox and pure terminal. Flavours I have used include: ➢ Gentoo ➢ CentOS ➢ Debian ➢ Mint ➢ Kali + Backtrack 2 & 3 ➢ RedHat ➢ (K)Ubuntu ➢ FreeBSD (UNIX) ➢ Knoppix ➢ Arch ➢ PHLAK ➢ OpenSUSE ➢ Fedora ➢ PCLinuxOS among many others
$1.111 USD en 20 días
0,0 (0 comentarios)
0,0
0,0

Sobre este cliente

Bandera de SOUTH AFRICA
Sandton, South Africa
0,0
0
Forma de pago verificada
Miembro desde ene 22, 2019

Verificación del cliente

¡Gracias! Te hemos enviado un enlace para reclamar tu crédito gratuito.
Algo salió mal al enviar tu correo electrónico. Por favor, intenta de nuevo.
Usuarios registrados Total de empleos publicados
Freelancer ® is a registered Trademark of Freelancer Technology Pty Limited (ACN 142 189 759)
Copyright © 2024 Freelancer Technology Pty Limited (ACN 142 189 759)
Cargando visualización previa
Permiso concedido para Geolocalización.
Tu sesión de acceso ha expirado y has sido desconectado. Por favor, inica sesión nuevamente.