AI-powered solution for
media content analysis
March 2024-ongoing
3 experts
Business Intelligence
USA
Summary
Business challenge:
Developing data-driven reputation management tools using evolutionary biology and visualization, with a focus on accurately analyzing media content.
Zoolatech approach:
We developed a proprietary AI-based algorithm for content analysis capable of summarizing, categorizing, and deduplicating content. We also aligned article categorization with metric changes to enable metric correlation. Additionally, we implemented automated reporting with AI-generated reports and recommendations based on metrics and article excerpts. Our tech stack includes Amazon Bedrock, Anthropic Claude, Amazon Titan, Langchain.js, and PostgreSQL with pgvector.
Value delivered:
  • Improved article relevance and text generation for personalized customer dashboards.
  • Enabled understandable and relevant reports.
  • Increased cost-efficiency.
Technologies:
Facing similar challenges? Contact our experts now.
About the Client

The client is a company specializing in data science and reputation management. They have developed an innovative product that integrates academic evolutionary biology and automated data visualization to improve the speed and accuracy of reputation management.

Business Challenge

An essential component of the product is a set of metrics and algorithms for calculating the numerical characteristics of the client’s reputation. Some reputation indicators rely on analyzing media content associated with the company. The client aims to provide its clients with monthly reports on reputation changes, detailed explanations, and actionable advice.

However, the media content supplied by a third party was often generalized, making it difficult to analyze and categorize. The generalized content was often irrelevant or lacked the context needed to link the article to the specific company or its reputation.

Zoolatech Approach

Before creating the report, we identified several problems related to the quality of the supplied content and its duplication. To address these issues, we:

  • Developed Proprietary Algorithms: We stopped using third-party articles and developed our own AI-based summarization and categorization algorithm.
  • Correlated Metrics with Article Categories: We used article categorization to align with changes in relevant metrics.
  • AI-Generated Reports: We used AI to generate reports and recommendations based on the provided set of metrics and excerpts from relevant articles.
  • Vectorization for Duplication Detection: We employed vectorization of article text to identify duplications or reposts before including them in the report.
media-genai2-pic1-0-669f9741
Implementation

Our solution utilized the Amazon Bedrock platform with the following foundation models:

  • Anthropic Claude: For report generation.
  • Amazon Titan: For summarization, categorization, and vectorization.

We used the Langchain.js framework running under Node.js, which decoupled the main logic from specific APIs and simplified work with various aspects of LLMs. Vector storage was implemented within the application’s main DBMS (PostgreSQL), utilizing the pgvector plugin.

Value Delivered
  • Improved Relevance: Switching to a proprietary algorithm enhanced article relevance and text generation for personalized customer dashboards.
  • Comprehensive Reports: Correlating changes in metrics with article categories and using deduplicated texts resulted in understandable and relevant reports.
  • Cost-Effectiveness: Leveraging multiple models on Amazon Bedrock with Langchain.js reduced overall solution costs.
media-graph2-3-0-669f9b21
Contact us
Let's build great
products together!