There are at the moment many artificial intelligence (AI) instruments available on the market that may take customers’ textual content and pictures and rework them into photos and movies that match the preliminary immediate. A brand new patent reveals that audio could quickly be an enter choice to deliver your visions to actual life.
As noticed by MSPowerUser, the US Patent and Trademark Workplace (USPTO) posted a 20-page document filed by Microsoft on April 5, 2023, and printed on October 10, 2024, that particulars a brand new AI-supported system that converts dwell audio into photos.
Additionally: Adobe’s free AI video generator is here – how to try it out
This technique would take an audio dwell stream, reminiscent of that from a gathering or lecture, and convert it right into a dwell textual content transcript. The transcript would then be summarized by a big language mannequin (LLM) and fed right into a text-to-image model, the place a picture could be generated and output on the display screen, as seen within the picture beneath.
This technique would proceed to do that through the audio stream, repeatedly producing dwell photos. In keeping with Microsoft, displaying photos in real-time may help make communication more practical, with visible aids maintaining individuals extra engaged and making ideas simpler to know.
“Displaying photos associated to verbally communicated data can improve the effectiveness of communication by making it extra partaking, memorable, and simpler to know,” mentioned Microsoft.
Additionally: The best AI chatbots of 2024: ChatGPT, Copilot, and worthy alternatives
If you happen to’re questioning whether or not the characteristic will launch quickly, the reply is almost certainly no. Submitting a patent is a protracted journey between producing a product or characteristic, and plenty of patents by no means make it into the manufacturing section and stay an concept.
Nevertheless, if Microsoft does determine to launch this characteristic, it could possible dwell in Microsoft Groups, its video conferencing assembly platform, and be accessible by its AI add-on, Copilot, reminiscent of Copilot Professional or Microsoft 365 Copilot for companies.
Source link
#Microsoft #audiotoimage #generator #works #patent #reveals
Unlock the potential of cutting-edge AI options with our complete choices. As a number one supplier within the AI panorama, we harness the facility of synthetic intelligence to revolutionize industries. From machine studying and information analytics to pure language processing and pc imaginative and prescient, our AI options are designed to boost effectivity and drive innovation. Discover the limitless prospects of AI-driven insights and automation that propel your enterprise ahead. With a dedication to staying on the forefront of the quickly evolving AI market, we ship tailor-made options that meet your particular wants. Be a part of us on the forefront of technological development, and let AI redefine the best way you use and reach a aggressive panorama. Embrace the longer term with AI excellence, the place prospects are limitless, and competitors is surpassed.