Filter

My recent searches
Filter by:
Budget
to
to
to
Type
Skills
Languages
    Job State
    216 multimodal jobs found

    ...discuss the specific details of the model and datasets further, along with your proposed approach. Requirements: Proven experience in fine-tuning and customizing ML models. Strong understanding of machine learning framework PyTorch. Ability to implement novel changes to ML models CNN/transformers. Please provide examples of relevant past work or projects you've completed. Task 0 Hierarchical multimodal transformers for Multi-Page DocVQA GRAM: Global Reasoning for Multi-Page VQA image newspaper dataset(will share link later) Task 1 Till now when we give input embeddings to the transformers they just go parallel they don't relate with each other. We want that input embeddings for the transformers should be learnable and related to each other and then go as input to ...

    $104 Average bid
    $104 Avg Bid
    11 bids

    ...Hugging Face, etc.). - Strong background in model training, fine-tuning, and real-world implementation. - Passion for mentoring and a clear communicator who can break down technical concepts. - Ability to customize learning experiences based on my current understanding and goals. Additional Skills (Preferred but not Required): - Experience with text-to-image models, language models, or multimodal AI applications. - Familiarity with AI ethics and best practices in the deployment of generative models. Project Duration: Flexible – The goal is to create a continuous learning experience, with the project lasting from a few weeks to several months depending on the scope and progression. If you have a passion for mentoring others and a deep understanding of generativ...

    $12 / hr Average bid
    $12 / hr Avg Bid
    6 bids

    ...source code (the backend is encrypted). Please help us replicate all the functionalities of the website and admin panel, and additionally develop two mobile frontends and two desktop frontends (iOS, Android, Mac, Windows). Add new features and optimize AI functionality: Including enhancements to the existing AI Chat feature (adding a canvas feature, search functionality, memory functionality, multimodal capabilities, document export feature, real-time voice interaction, voice-to-text, and memory functionality). For AI Drawing, add image editing, image extension, selective redrawing, background removal, and a canvas for image design. For AI Music, include one-click music generation, partial modification of generated music, text-to-lyrics matching, and adding our watermark logo. ...

    $7372 Average bid
    $7372 Avg Bid
    82 bids

    I'm looking for a professional who can design and implement a comprehensive anomaly detection solution for IoT devices. This multimodal security solution will integrate network traffic analysis with other data sources, specifically video data. The goal is to reinforce security against potential threats, so a strong focus on intrusion attempts and malware activities is essential.

    $199 Average bid
    $199 Avg Bid
    20 bids

    ...on enhancing my personal growth. Ideal Skills and Experience: - Expertise in personal development coaching - Ability to create custom growth plans - Strong communication and motivational skills - Experience in tracking progress and adapting plans Please note, I have not yet defined a specific area of focus within personal development, so flexibility and a broad skill set are key. Craft your Multimodal Project Proposal: You will have to write a Proposal that explains what each of the 5 multimedia texts be, what platforms/apps/tech you will use to craft them, and how these new texts will expand your reach beyond the audience of ENGL 1001 and your instructor, what purpose each multimedia text will serve, and how these 5 multimedia texts in total will be equivalent to the length/de...

    $80 Average bid
    $80 Avg Bid
    15 bids

    ...seeking a deep learning specialist with a focus on natural language processing. The project involves creating a multimodal emotional recognition system using text, speech, and images, primarily for healthcare analysis. Key Areas of Focus: - Deep Learning: Expertise in natural language processing is crucial. - Multimodal Emotional Recognition: Experience with text, speech, and image processing. - Healthcare Analysis: Understanding of healthcare dynamics and requirements will be a plus. Ideal Skills and Experience: - Proven track record in deep learning projects. - Prior experience with emotional recognition systems. - Strong understanding of natural language processing. - Multimodal data processing skills (text, speech, image). - Experience in healthcare-related AI...

    $341 Average bid
    $341 Avg Bid
    8 bids

    ...Ability to handle and source historical photos. Please incorporate a jungle style throughout the photo essay. Also, here some instructions for how to cite photos in the photo essay as well: In a traditional print essay, we would cite images by labeling images as Fig. # with a description of the image and then include a Works Cited entry for the image (or table or other non-text element). For a multimodal essay, you have more options. What's most important is that if the image is not a free stock image, you give credit to the author for the work.  Here are some options: 1. Some sites, such as Creative Commons and Wikimedia, include the citation information with the image. Use that citation when available. Copy the citation and add under the image. For example, an im...

    $103 Average bid
    $103 Avg Bid
    55 bids

    Please read the project description carefully before applying Hello everyone, I'm seeking a developer to work with on setting up a Multimodal RAG (Retrieval-Augmented Generation) server using LangChain, with SharePoint integration for real-time data access. This project should include thorough documentation, as my goal is to understand the process so I can do it myself. The Multimodal RAG server should support reading and storing PDFs and DOCX files (including embedded images) for future use. Here’s a helpful guide for multimodal RAG with LangChain: @shravankoninti/multimodal-rag-with-gpt-4-vision-and-langchain-60a6a13a92e4

    $47 Average bid
    $47 Avg Bid
    45 bids

    Development of a Comprehensive Multimodal AI Website and Software Frontend and Management Panel UI: Designed using Flutter. Backend and Functionalities: Developed using PHP. Platforms: Web version and mobile version (iOS, Android). Requirements: Ensure all code is developed from scratch without using licensed, encrypted, harmful, or unlicensed secondary development code. Upon completion, ensure the deployment is successful, and provide downloadable apps for mobile platforms. Languages Supported: Chinese and English. Figma UI: We are working on the Figma UI. Your role is to develop the features and integrate them with the UI. Core AI Features Area: Three-Column Layout: All function pages use a three-column layout (sidebar, dialogue area, canvas/function debugging area). History M...

    $2208 Average bid
    $2208 Avg Bid
    142 bids

    Hello I'm a python developer, I need help from a freelancer to help me install rag server with langchain I found the following guides : Thank you

    $39 Average bid
    $39 Avg Bid
    16 bids

    ...platforms for API integration. 8.13 Account Security System • Two-level Protection Feature: Provide two-level protection to prevent account theft. • View and Delete Logged-in Devices: Ensure account security by allowing users to view and delete logged-in devices. 8.14 Chatbot Function • External Plugin Module: Support integration of external APIs (e.g., search engine APIs) for fine-tuning and support multimodal functionality to enhance chatbot interaction. 8.15 Invitation Feature • Users can share the platform, invite new users to register, and undergo real-name verification to receive corresponding credit rewards. If the invited user recharges, a proportional rebate is provided. 8.16 Comprehensive Back-end Management System • Back-end Management Features...

    $350 Average bid
    $350 Avg Bid
    53 bids

    I'm seeking an expert in multimodal deep learning to create a recommendation system specifically for E-commerce. Key requirements: - Experience in developing recommendation systems - Proficiency in multimodal deep learning techniques - Understanding of E-commerce dynamics and needs Please note that the specific modalities (Audio, Visual, Text) to be used in the recommendation system have yet to be determined. Therefore, flexibility and creativity in proposing suitable modalities is a plus.

    $81 Average bid
    $81 Avg Bid
    20 bids

    Freelancer Project Requirement for AI Projects Project Overview: We seek a highly skilled AI developer or team to collaborate on a series of AI-based projects. These projects involve developing and implementing advanced AI models, focusing on geospatial-climate data analysis, time series forecasting, and multimodal learning. The work will be done on a project-by-project basis, with payment and timelines to be decided after initial discussions of the project scope. Required Skills and Expertise: PyTorch Lightning Framework: Proficient in using PyTorch Lightning for streamlined model training and experimentation. Experience in managing complex model training processes and scaling across multiple GPUs. Transformer Models: Deep understanding and experience with transformer architec...

    $65 Average bid
    $65 Avg Bid
    5 bids

    ...addition, the project will involve working with various APIs and services to enhance functionality and user experience. Key Requirements: - The ability to develop chrome extension app - The web extension will work and interact with interactive map and GIS data - Dealing with ChatGPT, Gimini or any other LLM with API - Work with information retreaval and webscraping and summarization - Work with multimodal generative AI models - with third party services like Zapier or twilio - Work with cloud storage and databases Ideal Skills: - Proficiency in Chrome extension development. - Experience working with GIS data and interactive maps. - Familiarity with ChatGPT, Gimini or any other LLM with API - Ability to create user-friendly interfaces. Please provide examples of prev...

    $1113 Average bid
    $1113 Avg Bid
    64 bids

    ...and date). The agent should extract key order details from the HTML files, such as order number, product names, quantities, prices, and delivery status, and store them in a structured format (e.g., JSON or CSV). The agent should make a decision to stop the process once all of the orders are fetched successfully. Tips and Guidelines The solution should leverage Large Language Models (LLMs) or Multimodal Language Models for natural language understanding, decision-making, and navigation. While other approaches can be combined with LLMs, the primary focus should be on utilizing LLMs effectively. The agent should handle user authentication securely, ensuring that user credentials are protected and the authentication mechanism is reliable. The agent should understand and interact with...

    $152 Average bid
    $152 Avg Bid
    26 bids

    I'm looking for an experienced freelancer to aid in creating engaging written content, visually appealing graphic designs, and captivating videos. This triple-threat approach is to foster awareness and generate support for a current campaign aimed at updating supporters, garnering attention and increasing donations towards a current and future legal case. Key tasks: - Crafting relatable and impactful written content - Designing striking graphics to grab attention - Producing dynamic, emotionally resonant videos Skills you will need: - Trustworthy - Integrity - Excellent writing ability for diverse audiences - Strong graphic design skills with a knack for visual storytelling - Proficiency in video creation, from storyboard to editing Evidence of successful similar past projects wil...

    $19 / hr Average bid
    NDA
    $19 / hr Avg Bid
    25 bids

    I need a 3D AI deskmate, resembling a humanoid, that can assist me with web browsing and file management. The AI should be able to see my desktop, browse the web, download files, and interact with the file explorer. Key Features: - Multimodal capabilities with vision and speech - 3D humanoid model - Interactive and engaging I already have the 3d model by the way. The AI's voice should be inspired by Lain from the anime 'Serial Experiments Lain' - so it should have a somewhat calm, yet engaging tone. Ideally, the freelancer for this project should have experience in: - AI development - 3D modeling - Voice modulation and synthesis - Creating interactive software on Desktop Please provide examples of similar projects you have worked on in your proposal.

    $32 Average bid
    $32 Avg Bid
    11 bids

    I am in need of an efficient and security-minded individual who can create a user-friendly digital platform for the purpose of marketing and selling digital art and music clips. Key Features include: - Provision for customers to purchase and download Images, Music, and Videos - Secure payment integration to ensure customer confidentiality and data protection - Immediate download feature upon completion of purchase - Preview option for customers to view the art or listen to the music clips before making purchases - Brief descriptions of each product to provide an overview of their value and appeal Ideal candidate should have: - Experience in creating secure ecommerce platforms - Strong understanding of UX/UI to develop a user-friendly storefront - Prior experience or knowledge in dealin...

    $257 Average bid
    $257 Avg Bid
    67 bids

    Role & responsibilities Fully conversant with international shipping Incoterms evolving towards an integrated, multimodal, door-to-door logistics approach. Has dealt with all the major shipping lines - MSC, CMA- CGM, Hapag Llyod, Maersk etc. as well as freight forwarders and nomination agents. Freight negotiation skill, Maintaining freight data, Freight Bill passing & dispute resolution. Handling Ex- works, FOB. C & F , CIF , DAP, DDU, DDP shipments in its full lifecycle. knowledge including ISF filing, shipping bill amendments, transmission of shipping bill on DGFT portal, EGM filings Order processing through WEB EDI., critical to us and hands on experience of monitoring, and resolving the issues is important. Domestic supply chain, LR Copy, Create Manage E-Way Bill...

    $7 / hr Average bid
    $7 / hr Avg Bid
    16 bids

    I am seeking a proficient developer experienced with Python, Computer Vision, and Machine Learning to develop a code primarily for Pathological image understanding in a multimodal setting. Required Skills and Experience: - Proficient in the use of Python for coding. - Strong familiarity with Computer Vision and Machine Learning techniques. - Experience in developing medical image codes. - Understanding of proper optimization for personal computer-based machine learning processes. Working together, we will strive to create an efficient and reliable object recognition system. A preference will be given to those who can clearly demonstrate past achievements in similar projects.

    $8 / hr Average bid
    $8 / hr Avg Bid
    28 bids

    I'm in need of a skillful freelancer who can create an engaging, enlightening and compelling visual multimodal presentation that primarily focuses on the importance of teaching writing to English-Language Learner (ELL) students. The presentation should cover these key aspects: - Strategies for vocabulary development - Techniques for grammar instruction - Approaches for improving writing fluency Ideal candidates should have a background in ELL education, curriculum design or linguistics. The presentation must include three of the following modes: still visual images, video, audio, gestural, spatial, or linguistic. Be creative. Utilize a format amenable to multimodality such as a video, PowerPoint presentation, Prezi, PowToons video, Podcast, webpage, or Screencast-O-Matic rec...

    $34 Average bid
    $34 Avg Bid
    36 bids

    ...data scientist or AI/ML developer to develop a model predicting YouTube video success using multimodal AI. The project involves analyzing video content, audio, and textual data (titles, descriptions, comments). Responsibilities: - Research existing literature on multimodal AI and video popularity prediction. - Design and implement a multimodal AI model integrating visual, audio, and textual data. - Use techniques such as CLIP, BERT, and frameworks like TensorFlow or PyTorch. - Perform data preprocessing and feature extraction from YouTube metadata. - Train and evaluate the model using relevant datasets. - Document the methodology, experiments, and results. Deliverables: - A functional multimodal AI model for predicting YouTube video success. - A detailed r...

    $115 Average bid
    $115 Avg Bid
    12 bids

    I need a professional to scrape all available hearing aid manuals, FAQs, help guides, and video links from the major hearing aid manufacturers. Key Requirements: - Scrapping should include major brands: Phonak, Oticon, Widex, Resound, Starkey, Rexton, Beltone, Jabra GN, TruHearing. - I need the original user manual PDFs, and all multimedia (videos, images, multimodal article URLs) listed in an Excel spreadsheet -- columns [manufacturer, hearing aid modal, resource name, brief description or resource purpose, URL to resource] - I need this completed ASAP. Ideal Skills: - Web Scraping - Organized and detail-oriented - Delivering work within tight deadlines Please note that I require the work to be done accurately and professionally. Experience in similar projects would be a defini...

    $125 Average bid
    $125 Avg Bid
    60 bids

    I'm in need of a proficient AI specialist who can work on a Natural Language Processing project for me. The primary goal of this assignment is to generate product descriptions. Requirements: - Experience with Multimodal LLM - Proficiency in Natural Language Processing - Prior experience with product description generation - Strong understanding of AI and machine learning The ideal candidate should be able to implement a Multimodal LLM approach to the generation of product descriptions. Please provide examples of related work in your bid.

    $332 Average bid
    $332 Avg Bid
    12 bids

    My primary goal is to create a sophisticated machine translation project that focuses on Arabic to English language conversions. This is a challenging project that requires specific skill-set and in-depth knowledge in Natural Language Processing. Features I need: - Develop a multimodal machine translation model (MMT) using visuals and text to achieve accurate translation from English to Arabic and vice versa. Ideal Freelancer should have: - Proven experience in Natural Language Processing - Strong background in Machine Translation models - Ability to select and implement the most suitable Machine Translation model or API for this project, as I am open to the best possible options. If you have the necessary skills to accomplish this task, feel free to bid.

    $221 Average bid
    Urgent NDA
    $221 Avg Bid
    26 bids

    ...with deep expertise in AI, particularly in the development of multimodal semantic embeddings. The goal of this project is to create an AI system that not only understands the meaning of both speech and images but also can interact bidirectionally between these two data types. The ideal developer should have extensive experience working with TensorFlow and a solid understanding of AI principles, specifically in relation to speech and image recognition. Key Requirements: - Development of an AI system capable of understanding both speech and image data - Integration of bidirectional interaction capabilities between speech and images - Extensive experience with TensorFlow for AI model development - Proficiency in working with multimodal semantic embeddings Ideal Skills and E...

    $155 Average bid
    $155 Avg Bid
    4 bids

    we can do a task in different ways: I want to extract text from images with but I have a vulnerability error code which concerns , firestore and others I don't see any report but that's it for me prevents us from going further to remove these vulnerabilities? or maybe a multimodal llm (I used the most famous one and had very good results) but I would like an open source llm (I use lm studio but I am open to other suggestions) rather specialized in this task who can offer me this rare gem and/or who implemented this type of solution?

    $33 Average bid
    $33 Avg Bid
    14 bids

    we can do a task in different ways: I want to extract text from images with but I have a vulnerability error code which concerns , firestore and others I don't see any report but that's it for me prevents us from going further to remove these vulnerabilities? or maybe a multimodal llm (I used the most famous one and had very good results) but I would like an open source llm (I use lm studio but I am open to other suggestions) rather specialized in this task who can offer me this rare gem and/or who implemented this type of solution?

    $47 Average bid
    $47 Avg Bid
    24 bids

    ...entire set of Seamless models. Feel free to play around with the notepad. Tutorial link: Request to install the programme from GitHUB: Seamless is a family of artificial intelligence models that enable more natural and authentic communication across languages. SeamlessM4T is a massive, multilingual, multimodal machine translation model supporting around 100 languages. SeamlessM4T forms the basis of SeamlessExpressive, a model that preserves elements of prosody and voice style across languages, and SeamlessStreaming, a model that supports simultaneous translation and streaming ASR for around 100 languages. SeamlessExpressive and SeamlessStreaming have been combined into Seamless, a unified model that

    $1114 Average bid
    $1114 Avg Bid
    71 bids

    ...iterate and refine app features. Required Skills and Qualifications: Proven experience in mobile application development with a portfolio of released applications on the Android and iOS markets. Strong proficiency in programming languages such as Swift, Kotlin, and/or Dart (Flutter). Experience with AI technologies, particularly NLP models suitable for mobile platforms, and familiarity with multimodal AI incorporating inputs and outputs such as text, voice, and image data. Familiarity with caching mechanisms and developing applications for offline use. Solid understanding of the full mobile development life cycle, including automated testing and building. Knowledge of NMEA (open standard in the marine electronics industry) protocols and experience integrating with onboard syste...

    $22 / hr Average bid
    $22 / hr Avg Bid
    53 bids

    I'm currently using TensorFlow for my machine learning model. I believe the model can be significantly improved by implementing a different algorithm. In particular, I'd like to e... The original code is attached below, which is as per the research paper above. It consists a model with CNN, Attention and LSTM and now I've to improve the accuracy of this model. For this I need to change the model with adding "Deforming CNN" which is given in the research paper named "Deformable Convolutional Networks for Multimodal Human Activity Recognition Using Wearable Sensors", which I am attaching below. So add the improvements of The second paper into the original code. The model is to be then trained with Uni Mib SHAR Dataset.

    $116 Average bid
    $116 Avg Bid
    16 bids

    I'm seeking a talented professional who can assist with multimodal analysis for customer feedback. You'll be tackling the task of sentiment analysis, focusing on enhancing customer experience. Requirements include: - Strong experience in analyzing sentiments in both text and audio formats. - The ability to work proficiently in English is a necessity; no other language capabilities are required for this project. - A record of successfully improving customer experiences through insightful analysis would be advantageous. Ultimately, the goal is to transform our customer feedback, allowing us to refine the customer experience. If you come equipped with the knowledge and skills to convert raw customer feedback (text and audio) into insights, this is an ideal project for y...

    $98 Average bid
    $98 Avg Bid
    19 bids

    I run an innovative automotive shipping business specializing in land, air, and sea transportation. I'm in need of a distinctive and compelling logo to reflect the multifaceted nature of our services. While I didn't specify any preferred colors, I encourage your creativity to shine and propose a color scheme that you believe captures the essence of a multimodal transportation outfit. The ideal freelancer for this project should: - Have significant experience in custom logo design - Possess an understanding of the transportation / shipping industry - Demonstrate ability in color theory and its implications - Be able to present a portfolio with an array of design styles - Exhibit impressive creativity and originality while sticking to professional standards. Remember t...

    $79 Average bid
    $79 Avg Bid
    159 bids

    I am seeking a creative and experienced freelancer who can help me create presentations that are visually engaging and informative. The content delivery must be a combination of pictures, illustrations, words, and graphs to effectively relay information. Key Responsibilities: - Create a rich tapestry of data and insights through a combination of images, illustrations, text, and graphical data. - Leverage different types of illustrations as appropriate, priority is given to those that can capture people's attention. - Utilize data from spreadsheets to create compelling graphs as part of the presentation. Ideal Experience And Skills: - Proven experience in creating engaging presentations - Strong data visualization and infographic design skills - Ability to see the larger picture ...

    $24 / hr Average bid
    $24 / hr Avg Bid
    80 bids

    ...EVALUATION Participants will be evaluated based on the performance of their models in understanding human input and generating appropriate responses. Evaluation metrics can include relevance, coherence, and potentially user satisfaction through crowdsourced assessments. ADDITIONAL POINTS AWARDED To make the competition more engaging and challenging, consider incorporating the following elements: • Multimodal Inputs: Participants can explore incorporating additional modalities, such as images or audio, to enhance the understanding of human input. • Ethical Considerations: Participants should address ethical considerations, such as bias mitigation and fairness, in their models. WINNER Competition winner will receive upto $1000 and a chance to work on a similar project o...

    $1001 Average bid
    Featured Guaranteed Top Contest
    $1001
    34 entries

    ...fellowship program, knows and is experienced in masterfully and safely blocking those nerves and transmissions of pain diagnostically and therapeutically to either free you or manage your pain so you can enjoy life again. Along with injections and other advanced interventional procedures and techniques, Dr Morris Solis also manages pain with safe prescribing of medications with an advanced multimodal approach. We treat your pain from head to toe that includes headaches, head pains, neck, pains, facial pain, trigeminal neuralgia, TMJ, joint pain to include shoulders, facet joints, sacroiliac joints, hips, knees, ankles, toes, fingers, hands, wrist, elbows. We treat pain stemming from the entire spine from cervical spine to thoracic to lumbar to the sacrum. We treat pelvic ...

    $129 Average bid
    $129 Avg Bid
    48 bids

    ...fellowship program, knows and is experienced in masterfully and safely blocking those nerves and transmissions of pain diagnostically and therapeutically to either free you or manage your pain so you can enjoy life again. Along with injections and other advanced interventional procedures and techniques, Dr Morris Solis also manages pain with safe prescribing of medications with an advanced multimodal approach. We treat your pain from head to toe that includes headaches, head pains, neck, pains, facial pain, trigeminal neuralgia, TMJ, joint pain to include shoulders, facet joints, sacroiliac joints, hips, knees, ankles, toes, fingers, hands, wrist, elbows. We treat pain stemming from the entire spine from cervical spine to thoracic to lumbar to the sacrum. We treat pelvic ...

    $54 Average bid
    Guaranteed
    $54
    56 entries

    ...role in our groundbreaking project. This initiative focuses on the creation of an interactive affective robot, endowed with a defined gender and a distinct personality, tailored to bolster the emotional well-being of senior citizens. The ideal candidate will possess robust technical expertise in software development, cloud services, machine learning, or NLP, coupled with a flair for integrating multimodal inputs (camera, voice, and text). Additionally, an aptitude for analyzing interaction data, contributing insightful findings, and co-authoring a research paper on the project's outcomes is essential. Project Duration: 1-2 months (potential extension based on project evolution and performance) Budget: $1,500 (Fixed budget for project completion and subsequent paper publicat...

    $406 Average bid
    $406 Avg Bid
    16 bids

    I'm looking for a skilled AI developer to create a multimodal model for accurately classifying emotions, leveraging the IEMOCAP dataset. The model ultimately needs to be developed using Python, as that's the language I'm comfortable with. Experience in creating multimodal AI systems, specifically for emotion classification, is crucial for this job. Also, familiarity working with the IEMOCAP dataset will be highly advantageous. To cut it short, 1) the dataset sizs is 8K records, data (audio, text and spectogram images) are all processed and parsed. 2) the goal is to build a deep learning model using Pytorch (tensorflow is an option too) where we compare the results of each modality separately, vs Multimodal using early, join or late fusion 3) I have ...

    $489 Average bid
    $489 Avg Bid
    46 bids

    I'm in need of a skilled developer capable of crafting a Windows DLL to facilitate communication over Bluetooth, Wifi, and USB. The DLL has to offer robust functionalities including: - Data transfer - Device discovery - Connection management Your proficiency in using C++ for DLL development will be an added advantage here. This project aims to build a DLL that not only supports these communication methods comprehensively, but also ensures smooth and efficient operation. The DLL should be designed keeping in view the complexities and requirements of data transfer, device discovery, and connection management protocols for each of the specified communication channels. Essential Skills: - Proficiency in C++ - Experience in DLL development - Knowledge of Bluetooth, Wifi, and USB commun...

    $279 Average bid
    $279 Avg Bid
    6 bids

    Objectives 1. To implement and analyze the state of the art AI based algorithms & methods used for multimodal stress detection. 2. To develop framework for multimodal stress detection by devising AI based algorithm using early fusion approach. 3. To improvise the algorithm using optimization technique. 4. To validate the algorithm on real time data. More details: What are the specific modalities you would like the stress detection algorithms to consider? Facial expressions,Speech,Physiological signals,heart rate,audio ,video. Do you have any preferences for the programming language to be used for developing the algorithms? AI What level of accuracy are you expecting from the stress detection algorithms? High accuracy

    $150 Average bid
    $150 Avg Bid
    5 bids
    Trophy icon Project proposal logo
    Ended

    I am writing a project proposal centred around the use of collaborative robotics for meat processing. The project has the title "AI-powered framework for flexible and scalable multimodal cobotics in meat processing" and for short "CoBUTCHER". I am in need of a logo to make the project proposal stand out. The logo should tell something about the full project title, but be based around the short form "CoBUTCHER". - The main logo should incorporate my project's name. - A smaller, text-free, version of the logo is also required. This could be, for example, a part of the main logo. - I'm looking for a design that's bold and colorful, standing out in visual appeal. - This logo will be used across various platforms including digital media such...

    $127 Average bid
    $127
    791 entries

    I am in need of a multi-talented content creator who is comfortable creating both written and graphic content. While the project doesn't have a specified deadline, I'd appreciate a capable and dedicated professional who can deliver quality work within a reasonable timeframe. The ideal candidate should be: - Proficient in written content creation, with originality and creativity being paramount. - Adept at graphic content creation, though the specific style hasn't been defined, flexibility in design styles would be a plus. Please include samples of both your written and graphic content in your bid, and feel free to ask any questions if you need more details. Let's get creating!

    $19 Average bid
    $19 Avg Bid
    21 bids

    I require comprehensive hand carry services for the transportation of valuable items. The chosen provider must offer a combination of both air and ground transport in order to ensure a streamlined, efficient service. My project involves the transfer of my valuable items to multiple destinations, hence I need someone with vast experience in handling such delicate and high-value deliveries. This venture will benefit greatly from these skills and experience: - Proficiency in handling and securing valuable items during transit - Prior experience in managing multiple delivery destinations - Expertise in both air and ground transportation logistics - In-depth understanding of both domestic and international delivery regulations and customs rules. Please contact me with your proposals, incl...

    $583 Average bid
    $583 Avg Bid
    3 bids

    ...implement a multimodal machine translation(Using image to improve the translation quality) task using the Bridgetower vision language model, accessible at and Your task involves taking a paragraph containing multiple sentences and its corresponding image as input. Fuse these inputs using the Calixto Multimodal NMT model, found at Additionally, for English to Hindi translation, leverage the MBart model at Ensure that the entire code is executed within a Google Colab environment, with a primary focus on fine-tuning the Bridgetower model for Multimodal Machine Translation

    $387 Average bid
    $387 Avg Bid
    5 bids

    For this project, I am in need of a skilled freelancer to complete a multi-modal data entry task. - TASK OVERVIEW: The job will entail the systematic and careful entering of both text and numerical data, as well as images. With a volume classed as medium, the job will involve the entry of between 100 and 1000 pieces of data. - SOURCE FORMAT: The data will be sourced online. It's crucial that the selected freelancer has a strong internet connection and is familiar with navigating various online sources. - IDEAL SKILLS AND EXPERIENCE: Proficiency in data entry is key, with previous experience with multi-modal (text, numbers, images) data entry being highly advantageous. Having a keen attention to detail and the ability to navigate online sources effectively is crucial. Experience i...

    $10 / hr Average bid
    $10 / hr Avg Bid
    23 bids

    For this project, I am in need of a skilled freelancer to complete a multi-modal data entry task. - TASK OVERVIEW: The job will entail the systematic and careful entering of both text and numerical data, as well as images. With a volume classed as medium, the job will involve the entry of between 100 and 1000 pieces of data. - SOURCE FORMAT: The data will be sourced online. It's crucial that the selected freelancer has a strong internet connection and is familiar with navigating various online sources. - IDEAL SKILLS AND EXPERIENCE: Proficiency in data entry is key, with previous experience with multi-modal (text, numbers, images) data entry being highly advantageous. Having a keen attention to detail and the ability to navigate online sources effectively is crucial. Experience i...

    $421 Average bid
    $421 Avg Bid
    39 bids

    We seek an expert team in conversational chatbot development to support the development of an intuitive, context-aware, and multimodal chatbot for reproductive health intervention delivery. The chatbot will be able to respond to questions in a multimodal format using prerecorded audios and videos to answer specific questions that require visual or vocal description. It will combine generative and retrieval capabilities using a pre-defined counselling botflow in 3 languages and being able to respond intelligently to local slang or incorrectly spelt words etc. It can recommend specific clinics based on users’ locations and escalate conversations for human intervention. It will be deployed on Telegram, WhatsApp, Facebook, and other web interfaces. This is an URGENT and ...

    $22 / hr Average bid
    $22 / hr Avg Bid
    17 bids

    I am looking for a freelancer who can help me with fine-tuning a multimodal model specifically for visual question answering. I have already prepared the necessary data for the fine-tuning process. The main objective of this project is to increase the accuracy of the model. Skills and experience needed for this job include: - Strong understanding of multimodal models and their fine-tuning process - Proficiency in visual question answering techniques - Knowledge of deep learning frameworks and libraries - Experience in data preparation and cleaning - Ability to analyze and interpret model performance metrics - Attention to detail and ability to troubleshoot and debug any issues that may arise during the fine-tuning process.

    $50 Average bid
    $50 Avg Bid
    14 bids

    Purpose put your analysis or research from your analysis essay or your argument essay into a visual medium that makes the most sense to you. You will adapt your paper into an “off-the-page” project, have a more direct involvement in your project than restricting yourself to library exploration, and create a multimodal presentation that animates your research. Directions 1) One of the main points from the analysis essay or the argument essay will still be the focal point of your project, but you may need to shift or change some of your ideas to accommodate your medium. 2) The key to your creative project is the way it appeals to your audience as a verbal or visual project. This should be an artistic endeavor, video, comic, song, podcast, or other medium of your choosing....

    $25 Average bid
    $25 Avg Bid
    30 bids