Skip to content

Instantly share code, notes, and snippets.

@TechWithTy
Created May 28, 2025 04:25
Show Gist options
  • Save TechWithTy/1d10297b3248a12a4d9524ac3a9f985d to your computer and use it in GitHub Desktop.
Save TechWithTy/1d10297b3248a12a4d9524ac3a9f985d to your computer and use it in GitHub Desktop.
Kling AI Prompting Guide
Effective prompting is key to generating desired video content with Kling. Kling AI uses a structured approach to prompts, allowing for detailed control over the generated video.
**1. Core Prompt Formula**
Kling utilizes the following formula for constructing prompts:
`Prompt = Subject (Subject Description) + Subject Movement + Scene (Scene Description) + (Camera Language + Lighting + Atmosphere)`
The components in parentheses `()` are optional but can significantly enhance the video output. The most fundamental components are the **Subject, Motion (Subject Movement), and Setting (Scene)**.
**2. Breakdown of Prompt Components**
* **Subject:** The main focus of the video. This can be people, animals, plants, objects, and so on.
* **Subject Description:** Describes the subject's appearance details and body posture. This can be a list of multiple short sentences. Examples include athletic performance, hairstyle and color, clothing and accessories, facial features, and body posture.
* **Subject Movement:** Describes the subject's movement status, including stillness and motion. This should be straightforward and suitable for a 5-second video.
* **Scene:** Represents the environment in which the subject is situated, encompassing the foreground, background, and other elements.
* **Scene Description:** Describes the scene for the subject's environment. It should be concise and focused, using a few short sentences to outline the setting without overwhelming the viewer. It should be suitable for what can be displayed within a 5-second video. Examples include an indoor scene, outdoor setting, or natural scene.
* **Camera Language (Optional):** Pertains to employing various applications of the camera lens, along with transitions and edits between shots, to communicate a narrative or message and generate particular visual impacts and emotional tones. Techniques include ultra-wide angle shots, bokeh (background blur), close-ups, telephoto shots, low-angle shots, high-angle shots, aerial views, and depth of field, among others. (Note: This should be differentiated from camera motion control.)
* **Lighting (Optional):** Light and shadow are vital elements that imbue photographic works with soul. The application of light and shadow can make photos more profound and emotionally resonant, enabling users to create works with a rich sense of depth and expressive power. Techniques include ambient lighting, morning light, sunset, interplay of light and shadow, Tyndall effect, and artificial lighting.
* **Atmosphere (Optional):** Describes the atmosphere of the anticipated video footage, which can involve various elements to set the mood and tone.
**3. Enriching Prompts: An Example**
Starting with a simple prompt like "A giant panda is reading a book in a café," you can enrich it by adding more descriptive elements:
* **Adding Subject and Scene Description:** "A giant panda, wearing black-rimmed glasses, is reading a book, with the book resting on a table. On the table, there is also a cup of coffee emitting steam, and next to it is the café's window." This creates a more specific and manageable image.
* **Adding Cinematic Language, Lighting, and Atmosphere:** "Shot in medium range, with a blurred background and atmospheric lighting, a giant panda, adorned with black-rimmed glasses, is seen reading a book in a café. The book lies on a table, accompanied by a steaming cup of coffee, steaming hot, next to the cafe windows, movie-level color palette." This further enhances the texture and visual appeal of the generated video.
**4. Tips for Effective Prompting**
* Use simple words and sentence structures, avoiding overly complex language.
* Keep the visual content as simple as possible, aiming for a completion within 5 to 10 seconds.
* Using words like "Oriental mood," "China," and "Asia" can more easily generate a Chinese style and depict Chinese people.
* Current large video models are not sensitive to numbers, making it difficult to maintain consistency in counts, such as "10 puppies on the beach."
* For a split-screen scene, you can use a prompt like: "4 camera angles, representing spring, summer, autumn, and winter".
* At the current stage, it is challenging to generate complex physical movements, such as the bouncing of a ball or the trajectory of a high-altitude throw.
**(Note: The original documentation mentioned "Updating, welcome to add more" indicating this is an evolving guide.)**
**5. High-Quality Examples from Kling Creators (Illustrative Prompts)**
These examples showcase the model's capabilities:
* "A giant panda is eating hot pot with chopsticks, with the street as the background."
* "A Pikachu is sitting on a chair, drinking coffee and reading a newspaper."
* "A polar bear is playing the violin in the snow."
* "A bee with a puppy's head."
* "Morning mist, sunrise, lens flare, and a cool breeze. A young Chinese woman with exquisite facial features, her long hair blown by the wind, strands of hair scattered across her face, dressed in summer attire, with a seaside beach as the backdrop."
* "Indoor shooting, close-up, a Chinese child is eating dumplings."
* "A beautiful girl with Chinese style."
* "A Chinese little girl is holding a pink balloon and smiling happily in the playground, with a slide in the background."
* "Aerial shot, blue waves pounding against the rocks, a magnificent and magnificent scene."
* "A medieval sailing ship sailing on the sea, a foggy night, bright moonlight, and an eerie atmosphere."
* "First-person perspective, high-speed flight, symmetrical composition, rotation, countless lightning bolts amidst dark clouds, motion blur."
* "The camera zooms into a beacon tower on the Great Wall, first-person perspective, high-speed flight, symmetrical composition, motion blur, and atmospheric lighting."
* "A space fighter jet speeds through a huge sci-fi internal tunnel, rushes out of the tunnel into space, and a space battle can be seen at the end of the tunnel."
* "A racing car is racing on the surface of the moon against a space backdrop, with tilt-shift zoom effect."
* "Aerial shot of a cyberpunk city."
* "On an alien planet, the streetscape of a cyberpunk city, with futuristic buildings, the camera slowly advances forward, and there are pedestrians on the street."
* "A woman is engaged in a gunfight with someone in an alley, with a Blade Runner-style atmosphere, neon lights, and ambient lighting."
* "First-person perspective, a man driving a car on a night street with fireworks blooming ahead."
* "A circling camera shot captures a handsome young man dressed in ancient clothing, wearing white, seated by the pond with his eyes closed, meditating."
* "The back view of a woman, in a red long gown, standing on the rooftop, with buildings smoking in the distance."
**6. General Video Generation Parameters**
* **Video Length:** Kling can generate 5-second or 10-second videos.
* **Modes:**
* "Standard Mode": For quicker video production.
* "Professional Mode": For superior image quality.
* **Aspect Ratios:** Supports 16:9, 9:16, and 1:1.
By understanding and utilizing this structured prompt formula, its components, and the provided tips, users can more effectively guide Kling AI to produce videos that align with their vision. Continuous exploration and experimentation are encouraged to fully tap into Kling's potential.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment