Pages

Se afișează postările cu eticheta artificial intelligence. Afișați toate postările
Se afișează postările cu eticheta artificial intelligence. Afișați toate postările

marți, 28 octombrie 2025

News : Inpaint4Drag framework with Google Colab demo.

Inpaint4Drag introduces a novel framework that decomposes drag-based editing into pixel-space bidirectional warping and image inpainting. Our method achieves real-time warping previews (0.01s) and efficient inpainting (0.3s) at 512×512 resolution, significantly improving interaction experience while serving as an adapter for any inpainting model.

News : Lightricks introduced LTX-2

Lightricks introduced LTX-2, an open-source AI model that creates synchronized 4K video and audio in real time. It supports multi-keyframe conditioning, 3D camera logic, and various inputs like text, image, video, and audio for precise, style-consistent video generation up to 10 seconds long at 50 fps. LTX-2 runs efficiently on consumer GPUs with multiple performance modes for different creative needs

News : 13,000x faster even the fastest classical supercomputers.

Today, we’re announcing research that shows — for the first time in history — that a quantum computer can successfully run a verifiable algorithm on hardware, surpassing even the fastest classical supercomputers (13,000x faster). It can compute the structure of a molecule, and paves a path towards real-world applications.

News : UltraGen: High-Resolution Video Generation with Hierarchical Attention

UltraGen is a new AI that makes high-resolution videos, even 4K, more efficiently by using a smart attention system that balances local detail and global consistency. This can scale up video quality better than previous AI models or two-step super-resolution methods ...

News : DyPE: Dynamic Position Extrapolation for Ultra High Resolution Diffusion

DyPE (Dynamic Position Extrapolation) enables pre-trained diffusion transformers to generate ultra-high-resolution images far beyond their training scale. It dynamically adjusts positional encodings during denoising to match evolving frequency content—achieving faithful 4K × 4K results without retraining or extra sampling cost.

sâmbătă, 25 octombrie 2025

News : ... AI music online tools that compose, remix, and inspire !

... today I tested some free AI for music based on my tasks.
You can test these platforms not all are free. These use artificial intelligence to help users generate original music, soundscapes, and beats—whether you're a hobbyist, content creator, or professional musician.

luni, 20 octombrie 2025

News : Google AI Studio - online apps and projects.

Google AI Studio is a browser-based platform that lets you:
  • access Gemini AI models
  • apps and code development projects
  • fast prototyping and easy integration
You can see on this official website.
I tested the Fit Check application with my avatar photo, and this is the result:

News : new Runway Aleph features for video

New changes on the Runway website :
Runway Aleph is a state-of-the-art in-context video model, setting a new frontier for multi-task visual generation, with the ability to perform a wide range of edits on an input video such as adding, removing, and transforming objects, generating any angle of a scene, and modifying style and lighting, among many other tasks.

joi, 16 octombrie 2025

News : NVIDIA NeMo and NIM: AI Tools

Today I tested this and works well:
NVIDIA NeMo™ is an end-to-end platform for building and customizing enterprise-grade generative AI models that can be deployed anywhere, across cloud and data centers. NeMo Curator is a GPU-accelerated data-curation tool that improves generative AI model accuracy by processing text, image, and video data at scale for training and customization. Apply for the video curation early access program below.
NVIDIA is revolutionizing the way developers build and deploy generative AI with two powerful platforms: NeMo and NIM.NeMo offers a modular framework for customizing large language models, multimodal agents, and speech systems, while NIM delivers optimized microservices for fast, scalable inference across any NVIDIA-powered infrastructure.
Whether you're building enterprise-grade AI or experimenting with cutting-edge models, these tools provide the flexibility, performance, and simplicity needed to accelerate innovation. Explore NeMo's capabilities on the NeMo Developer Portal and test NIM APIs directly via NVIDIA Build.

duminică, 12 octombrie 2025

News : Copilot and poetry - 12.10.2025 .

I spend some time with copilot and this is the result:
Upon a map of flick’ring light,
Where firewalls rise like rampart height,
Two wardens clad in ciphered mail
Did ride through circuits worn and frail.
One bore a sigil, firm and bright,
A seal of truth in cryptic rite;
The other held a lance of scan,
To pierce the veil of shadowed man.
Beneath their steeds, the troglodytes
Did crawl through folders lost to blight,
And relics, cursed with ancient code,
Lay buried deep in data’s road.
Above, the flags of packets torn
Did flutter like a ghost forlorn,
And whispers from the RAM-bound vale
Spoke of passwords grown cold and pale.
Yet lo — within a server’s keep,
Where silence coils and shadows creep,
A wraith did stir, with fingers thin,
A hacker veiled in digital sin.
No helm he wore, nor armor true,
But bore a charm of twisted hue:
An app disguised, a spell unclean,
A script of chaos, sharp and keen.
The wardens felt his crooked trace,
Not by scent, but by misplace —
By angles bent, by logs unround,
By logic’s cry, a broken sound.
They rode unto the cache-bound gate,
Where silence held the hand of fate,
And with a gesture, swift and grim,
They cast the wraith from system’s rim.
No clash of swords, no cry of war,
Just order, woven evermore —
In realms where not all souls are brave,
And some but seek a silent grave.

duminică, 21 septembrie 2025

marți, 19 august 2025

News : GPT-5 vs Sonnet-4: Side-by-Side on Real Coding Tasks on Theia I.D.E.

The Theia IDE is a modern IDE for cloud and desktop built on the Theia Platform.
The Theia Platform is a framework for building custom, tailored cloud & desktop IDEs.
... another video from the EclipseSource - official channel.

vineri, 15 august 2025

News : perchance website - test , development note ...

I tested today the perchance with empty data and give me one result.
Also, the development team say on this development note on webpage - 8 august 2025:
Image Gen Update (August 8th)
I'm in the middle of updating the text gen model (used for story/chat/etc) and it turns out it's going to cost a bit too much.
I need to temporarily reduce the quality and speed of image gen so I can deploy the new text gen.
Image gen was already mediocre in terms of quality and speed, I know. And to add to the annoyance, you'll probably need to update your prompts to get back to the same styles. Sorry about that.
There may be some initial bugginess with this update - please bear with me while I fix any issues that come up. Edit: Currently there's an issue where prompts are being ignored sometimes, resulting in random/weird/nonsense images. I'm working on fixing it.
Once I'm finished with the text gen update, I'll get back to optimizing the image gen. It should be possible to get it down to a few seconds per image after a couple of months of work.
Aside: Thank you to those who don't block the advert - the AI stuff on Perchance would very literally be impossible without you. Of course, some people have no choice (e.g. network-level blocking that they don't control), and some people don't have any ads load for them due to the country they're from (e.g. sanctions, low advertising demand, etc.) or other issues like that. Either way, I'm working as hard as I can to optimize the models so Perchance can always be completely free and unlimited for everyone.
Let's see the result with empty data:

News : Skywork Matrix-Game 2.0

This is an open-source interactive world model generating minutes of playable video at 25 FPS, similar to DeepMind's Genie 3.
The project can be found on this website.
You can find the model on the Skywork - huggingface.co .
The jbilcke-hf user has a demo on the huggingface.co webspage with this AI .

joi, 14 august 2025

News : some changes about Google AI.

Today, I received one mail from google ...
Hello Cătălin George, You can now generate high-fidelity, 720p videos with native audio using Veo 3 and Veo 3 Fast, available in paid preview in the Gemini API. ...
The first test with this A.P.I. was on 21 may 2025, about the Veo 3 on Google you can read on the gopogle blog with one article on 17 july 2025.
The last changelog was on 7 august 2025 on the google developer website with new features:
allow_adult setting in Image to Video generation are now available in restricted regions.
The mail also tell me about:
Veo 3 brings your prompts to life, creating 8-second videos with native audio. It lets you generate video from a text prompt, an initial image, or a combination of both to guide the style and starting frame. It can create a wide range of visual styles and natively generate dialogue in multiple languages, as well as sound effects and ambient noise.
... and has a simple python script for the A.P.I. and genai python package.
import time

from google import genai

marți, 12 august 2025

News : Intel® Software Developer Tools 2025.2 Release.

... this released comes with faster AI inference, real-time rendering, and expanded HPC support for 3D/graphics workloads, article from the official website.
... optimized performance and productivity for AI, graphics, and accelerated compute.

News : New tools for real-world and interactive 3D simulations ...

Generative AI has rapidly moved from research labs into the hands of developers, artists, and product teams—enabling new forms of content creation for many industries, with the promise of greatly improved efficiency. But it's not so easy to get up and running, there are a lot of practical challenges: deploying large models efficiently, managing ever-changing dependencies, keeping up with hardware requirements, and maintaining reliable performance across environments ...

News : Microsoft Copilot 3D new feature.

Microsoft Copilot 3D Launch: Microsoft released Copilot 3D, a free AI tool in Copilot Labs that converts 2D images as JPG/PNG file types up to 10MB into 3D GLB models without text prompts. It's globally available on the web and useful for design, AR/VR, gaming, and 3D printing.