#

metadata-extraction

Here are 472 public repositories matching this topic...

kreuzberg

kreuzberg-dev / kreuzberg

A polyglot document intelligence framework with a Rust core. Extract text, metadata, and structured information from PDFs, Office documents, images, and 50+ formats. Available for Rust, Python, Ruby, Java, Go, PHP, Elixir, C#, TypeScript (Node/Bun/Wasm/Deno) —or use via CLI, REST API, or MCP server.

ruby python java rust golang php node elixir ffi wasm tesseract text-extraction metadata-extraction table-extraction pdfium rag pdf-extraction document-intelligence

Updated Jan 11, 2026
HTML

schemacrawler / SchemaCrawler

Free database schema discovery and comprehension tool

java documentation schema database jdbc reverse-engineering driver schemaspy database-schema schemacrawler database-diagrams metadata-extraction er-diagram entity-relationship-diagram database-document database-documentation e-r-diagram

Updated Jan 11, 2026
HTML

deepjyoti30 / ytmdl-web-v2

Web version of ytmdl. Allows downloading songs with metadata embedded from various sources like itunes, gaana, LastFM etc.

audio metadata spotify youtube download itunes songs webapp music-download online-music metadata-extraction no-ads freesoftware freemusic high-quality-music free-music ytmdl audio-extraction free-music-download

Updated Jan 16, 2024
Vue

tern-tools / tern

Tern is a software composition analysis tool and Python library that generates a Software Bill of Materials for container images and Dockerfiles. The SBOM that Tern generates will give you a layer-by-layer view of what's inside your container in a variety of formats including human-readable, JSON, HTML, SPDX and more.

python docker open-source tool containers compliance dependencies spdx metadata-extraction risk-management software-composition-analysis oss-compliance sbom supply-chain-security

Updated Mar 12, 2024
Python

photostructure / exiftool-vendored.js

Fast, cross-platform Node.js access to ExifTool

nodejs metadata photos image movies video cross-platform photography images gps exif videos photo exiftool photographs metadata-extraction

Updated Jan 9, 2026
TypeScript

CeON / CERMINE

Content ExtRactor and MINEr

java pdf machine-learning metadata-extraction reference-parsing affiliation-parsing

Updated Jun 30, 2022
Java

m8sec / pymeta

Utility to download and extract document metadata from an organization. This technique can be used to identify: domains, usernames, software/version numbers and naming conventions.

metadata python3 pentest metadata-extraction pentest-tool extract-metadata information-disclosure

Updated Jun 19, 2024
Python

aydinnyunus / exifLooter

ExifLooter finds geolocation on all image urls and directories also integrates with OpenStreetMap

golang metadata security image osint hack hacking exif bug-bounty bugbounty exiftool cyber-security metadata-extraction exif-metadata redteam

Updated Jul 14, 2024
Go

MartinStyk / AndroidApkAnalyzer

Android application for analyzing installed apps

master-thesis apk android-application analyzer metadata-extraction androidmanifest

Updated Jun 1, 2024
Kotlin

photostructure-for-servers

photostructure / photostructure-for-servers

PhotoStructure for Servers

macos linux metadata photos video ubuntu photography jpeg image-processing photo-browser image-viewer photo metadata-extraction metadata-management gallery-images raw-image photostructure

Updated Sep 11, 2024
Shell

MK-Ware / Forensic-Tools

A collection of tools for forensic analysis

Updated Sep 12, 2019
Python

ExifGlass

d2phap / ExifGlass

📷 EXIF metadata viewing tool

dotnet exif avalonia avaloniaui exif-data-extraction exiftool metadata-extraction exif-reader exif-metadata

Updated Nov 2, 2025
C#

adultmm / AdultMediaManager

Adult Media Manager is the ultimate media manager for your adult movies and videos. Organize your content for Kodi, Plex, and other media centers.

metadata-extraction media-manager adult-contents

Updated Jul 22, 2025

TRACE-Forensic-Toolkit

Gadzhovski / TRACE-Forensic-Toolkit

Digital forensic analysis tool that provides a user-friendly interface for investigating disk images.

Updated Nov 12, 2025
Python

wikimedia / html-metadata

MetaData html scraper and parser for Node.js (supports Promises only)

nodejs javascript web-scraper web-scraping node-module metadata-extraction metadata-extractor

Updated Oct 16, 2025
JavaScript

OpenGraph

shweshi / OpenGraph

A Laravel package to fetch Open Graph data of a website.

metadata laravel laravel-package opengraph opengraph-tags hacktoberfest metadata-extraction opengraph-data laravel-opengraph

Updated Nov 14, 2025
PHP

mauricelambert / SpyWare

This package implements a complete SpyWare.

screenshots clipboard python3 recorder spyware keylogger connections metadata-extraction webcam-capture pypi-packages

Updated Nov 18, 2024
Python

adbar / htmldate

Fast and robust date extraction from web pages, with Python or on the command-line

nlp metadata natural-language-processing datetime date information-extraction web-scraping opengraph digital-forensics webscraping metadata-extraction date-parser entity-extraction forensics-tools

Updated Nov 4, 2025
Python

grisuno / LazyOwn

LazyOwn RedTeam/APT Framework is the first RedTeam Framework with an AI-powered C&C, featuring rootkits to conceal campaigns, undetectable malleable implants compatible with Windows/Linux/Mac OSX, and self-configuring backdoors. With its Web interface and powerful Console Client, it is the best combination for your RedTeam/APT campaigns.

Updated Nov 30, 2025
Python

BetaHuhn / metadata-scraper

🏷️ A JavaScript library for scraping/parsing metadata from a web page.

metadata parser typescript open-graph javascript-library page meta-tags metatags metadata-extraction html-scraper

Updated Jan 5, 2026
TypeScript

Improve this page

Add a description, image, and links to the metadata-extraction topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the metadata-extraction topic, visit your repo's landing page and select "manage topics."