Skip to content
View TFMV's full-sized avatar

Highlights

  • Pro

Organizations

@Veloce-Data-Solutions

Block or report TFMV

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
TFMV/README.md

Typing SVG

GitHub Stats

repos-per-language most-commit-language

profile-summary-cards

About πŸ§‘β€πŸ’»

Welcome to the GitHub repository of Thomas F McGeehan V, a seasoned Data Technology Architect with a rich portfolio spanning over two decades in the field of data engineering and analytics. I hold nearly a dozen patents and have led several high-impact projects across various industries, demonstrating a consistent commitment to excellence and innovation.

Expertise 🌟

Expertise Description
πŸ—οΈ Data Architecture & Engineering Designing and implementing resilient, performant, and scalable data platforms that cover all phases of data lifecycle management, from ingestion and integration to consumption and analytics.
πŸ€– Machine Learning & AI Democratizing machine learning applications, making advanced analytics accessible in innovative ways.
☁️ Cloud Solutions Extensive experience with major cloud platforms, including both public clouds and on-premise solutions.

Current Projects πŸ”₯

Project Description
βœ–οΈ BigQuery BigFunctions As an active contributor to the open-source BigFunctions project, I help develop advanced SQL functions that extend the capabilities of Google BigQuery, enabling more efficient and powerful data transformations and analyses.
πŸ” AddressMatchPro A Go solution for approximate entity matching, focusing on standardizing street addresses in the USA. This project utilizes advanced algorithms to ensure high accuracy and efficiency in entity resolution tasks.
🧠 PromptTriad An innovative Go API hosted on Cloud Run that leverages three competing AI models (OpenAI, Gemini, and Cohere) to collaboratively engineer and optimize the best possible prompt from any given input. The project focuses on integrating these APIs, implementing response evaluation using cosine similarity, and providing robust logging and monitoring.
🐘 GCS2Postgres A Go-based solution designed to load various open data formats stored in Google Cloud Storage (GCS) and BigQuery into a PostgreSQL database. It supports multiple file formats, utilizes BigQuery for data processing, and ensures secure PostgreSQL credentials retrieval from Google Secret Manager.
πŸ‘“ LinguisticLens An API for analyzing text using OpenAI's language model, focusing on emotional, factual, and implicit aspects. It identifies and explores dark triad traits, hidden meanings, and tonal nuances to provide a comprehensive text analysis. Built using the Gin framework.
πŸ€– BQ Multi Agent A platform leveraging multiple AI agents to interact with Google BigQuery for enhanced data analytics. This project aims to optimize query performance and provide insightful data analytics through a multi-agent architecture.
🏹 ArrowLake A data lakehouse architecture integrating Apache Arrow and Iceberg to optimize large-scale data processing, analytics, and real-time streaming. This project includes vector database integration using pgvector for GenAI, LLMs, and transformer architectures, and leverages Storj for decentralized, secure, and scalable cloud storage.
πŸ›©οΈ Flight A Go implementation of the Apache Arrow Flight SQL protocol. This project enables efficient, high-performance data transport using Arrow Flight, facilitating interoperability and enhancing the data processing capabilities of modern data systems.

Core Values βœ…

Value Description
πŸ’‘ Innovation Continuously pushing the boundaries of technology to create solutions that not only meet current needs but also foresee and address future challenges.
πŸ† Leadership Building and nurturing teams that are not only technically proficient but also innovative and forward-thinking.
⭐ Excellence Consistently striving to exceed expectations through high-quality work and persistent dedication to improving and evolving in all aspects of technology and leadership.

Contact πŸ“¬

Connect with me to discuss potential collaborations, or if you’re looking for guidance or mentorship in data technology and architecture:

Links πŸ”—

Pinned Loading

  1. AddressMatchPro AddressMatchPro Public

    AddressMatchPro is a sophisticated address matching and deduplication tool designed to handle large datasets with high precision. Leveraging advanced algorithms, it ensures accurate matching of cus…

    Go

  2. GCS2Postgres GCS2Postgres Public

    A Go-based solution designed to load various open data formats stored in Google Cloud Storage (GCS) and BigQuery into a PostgreSQL database.

    Go

  3. PromptTriad PromptTriad Public

    A Go API hosted on Cloud Run that leverages three competing AI models to collaboratively engineer and optimize the best possible prompt from any given input.

    Go

  4. FrostyBridge FrostyBridge Public

    FrostyBridge is a Python solution designed to export entire PostgreSQL databases to various storage systems in open data formats including Iceberg.

    Python 1

  5. FractPunk FractPunk Public

    FractPunk generates stunning fractal images with vibrant colors, random shapes, and whimsical phrases. Experience a unique blend of mathematical beauty and creative flair in every piece

    Go