연구 분야: Artificial Intelligence
학회: Signal, Image and Video Processing
Piano music has long been a focus in automated music generation due to its complexity and versatility. However, optimizing neural networks for generating musically coherent, emotion-driven piano compositions remains a challenge. This study aims to develop an optimized neural network for automatic piano music generation using the adjustable woodpecker optimized malleable generative adversarial network (AW-MGAN). To achieve this, a dataset comprising a diverse collection of piano music pieces ranging from classical to modern styles was gathered. These pieces were selected based on their harmonic complexity, rhythmic diversity, and emotional expressiveness to ensure a rich source for training. In the context of multimedia signal processing, data preprocessing was conducted to reduce noise and simplify patterns within the music data while retaining essential features. Feature extraction was implemented using the Mel frequency cepstral coefficients (MFCC), a technique known for its effectiveness in capturing the essential characteristics of music. The processed data was then fed into the proposed AW-MGAN framework; the generative model was optimized through the AW optimization algorithm. This approach allows the model to adjust and refine its neural network layers dynamically, ensuring unnecessary complexity is minimized and essential features are retained. Proposed framework showed significant improvements in generating harmoniously coherent and emotionally resonant piano compositions. Research that we have contrasted with the approach and conventional technique evaluates average score (7.521), recall (95.2%), precision (90.65%), entropy (6.215). Results demonstrated measurable enhancements in both the diversity and precision of generated music, confirmed by both machine-based evaluation metrics and human listening tests.
| 발행 연도 | 2025년 |
|---|---|
| 인용수 | 0 |
| 출판 국가 | Australia, China |
| 사이트 | Springer |
| 좋아요 수 | 0 |