Build a Generative AI based Automated Image Captioning and Visual QnA Engine

Learning Objectives: After completing this course you will,

  • Understand the basics of Large Language Text/Image models.
  • Familiarize with the architecture of Salesforce’s BLIP model.
  • Learn how you can use this model for automated Image Captioning & Visual Question Answering in your local machine.

Complete this course in 3 easy steps to earn your certificate!

STEP 1 : Watch the below self-guided tutorial.
STEP 2 : Practice as you watch the video by installing and working with the kandi 1-click solution kit.
STEP 3 : Complete the assessment to receive your certificate.

STEP 1 : TUTORIAL

Watch this self guided tutorial on “AI based Automated Image Captioning and Visual QnA Engine”.


STEP 2 : PRACTICAL EXERCISE

Click the below button to access the Image Captioning and Visual QnA Engine kandi kit. This kit has all the required dependencies and resources you need to build your application.

Click on the 1-Click Installer button on the kandi kit page to install the Image Captioning and Visual QnA Engine kit. On installing and running this kit, you will have a working model that you can customize and use in your project.

kandi 1-Click Kit - Dark


STEP 3 : ASSESSMENT

Complete a short assessment and earn your certificate now. Congrats

Take Assessment

Your assessment will be reviewed and you will receive a verified certificate via email within a week.


SUPPORT

Reach out to us by replying below for any help you may need with this course.

We hope you enjoyed using kandi! Continue your learning journey with kandi Congrats

EXPLORE MORE TOPICS:

ChatGPT Voice-to-Image Generator (4)

1 Like