ComicraftAI

Design
App UI

Inspiration

We’re constantly looking for ways to express our ideas, emotions, and stories in engaging and creative formats. Whether it's capturing a funny moment from our daily lives, sharing an insightful thought, or communicating a complex idea, visual storytelling has been a powerful tool. Especially comics offer an immersive experience that combines the visual engagement of movies with the imaginative depth of literature, resonating with people of all ages. However, not everyone has the artistic skills or time to create a comic from scratch. This app was inspired by the everyday desire to share stories visually, making it possible for anyone, regardless of their background, to generate compelling comic strips with ease using AI.

What it does

AI-Powered Comic Generation: Instantly create comic strips by typing your story. The AI interprets your narrative and generates a visual representation, turning your words into engaging panels.

Style and Character Consistency: You can select the style and characters for your comic, ensuring that the visuals maintain a coherent and professional look throughout the story in multiple pages.

Customizable Panels: Choose the number of panels for your comic, whether you need a single scene or a multi-panel sequence, giving you control over the complexity of your storytelling.

Captioning and Dialogue Options: Turn on or off captioning and dialogue bubbles to control how much text accompanies your visuals, tailoring the experience to your storytelling needs.

How we built it

Backend Foundation: Built on Node.js with Express, ensuring a robust and scalable platform.

Content Generation: Utilizes OpenAI's GPT model, fine-tuned specifically for generating comic content like panel descriptions, dialogues, and captions and narrating selected style of comics.

Image Creation: Leverages Stable Diffusion (SDXL) for high-quality image generation.

Character Consistency: Ensured through LoRA (Low-Rank Adaptation) models, which are trained on specific character styles to maintain a coherent and recognizable appearance throughout the comic.

Key Innovation: The combination of GPT and LoRA models streamlines the comic creation process, enabling users to focus on storytelling while the app manages technical aspects like character consistency and visual quality.

Challenges we ran into

Researching Comic Styles: Identifying and understanding various comic styles and determining the most effective way to prompt the computer to generate visuals in those styles required extensive research and experimentation.

Character Consistency: A significant challenge was discovering and implementing methods to ensure selected character's consistent appearance across different scenes/ panels, prompts and styles.

Canva SDK Integration: Navigating and understanding the Canva SDK’s frontend to ensure successful integration with the app posed a technical challenge, requiring careful adaptation to fit the app’s requirements.

Handling NSFW Content: Addressing potential issues where AI-generated content might be flagged as NSFW. This involved developing a system to automatically generate alternate prompts and ensure appropriate content is provided to users.

What's next for ComicraftAI

Performance Optimization and Cost Efficiency:Experiment with faster generation models like Flux to reduce latency and improve real-time interaction, and explore cost-saving deployment strategies for image models and infrastructure

Panel-Specific Editing: Enable users to edit prompts and regenerate visuals for each panel separately after the initial generation, providing greater control and customization.

Enhancing Character Consistency: Perfecting character consistency and expand the library of characters to offer more options.

Expanding Styles: Introduce additional comic styles to cater to a wider range of artistic preferences and narrative needs.

Frame Grid Templates: Develop inbuilt frame grid templates to assist users in composing generated panels into their preferred style.

Multi-Character Selection and Labeling: Enable multi-character selection within a single prompt, with custom character labeling to recognize names instantly.

Custom Character and Scene Upload: Allow users to upload their own characters and scenes for reuse.

Scene and Background Preset Saving: Introduce scene or background saving features to be used as presets.

Built With

canvaappssdk
node.js
react
stablediffusion
typescript

Submitted to

Canva AI and Integrations Hackathon

Created by

I worked on the backend , integrated LLM for content generation and SDXL for image creation. I ensured compatibility with the frontend implemented with Canva Apps SDK, and deployed the application

Ankith Reddy Pati
Master's student at Northeastern University, transitioning into software engineering by learning and building cool stuff
Implemented frontend UI elements using the Canva SDK. I researched and defined key features to be integrated in the application. Also, trained SDXL for LoRA models.

Sruthi Sivasankar
Recent entrant into coding, I'm building projects to hone my skills in software development with practical, impactful applications.