题名

A DEEP LEARNING BASED INNOVATIVE ONLINE MUSIC PRODUCTION FRAMEWORK

DOI

10.7903/ijecs.1933

作者

Sung-Shun Weng;Hung-Chia Chen

关键词

deep learning ; music production ; innovative operation model

期刊名称

International Journal of Electronic Commerce Studies

卷期/出版年月

13卷1期(2022 / 03 / 01)

页次

1 - 32

内容语文

英文

中文摘要

In this research, we constructed an online music production process framework to explain the operation of the industry and conduct comparative analysis, competitive analysis, and innovative business model analysis for the current music industry situation. This research can provide strategic suggestions for future scholars and industry development. In this study, through the concept of the internet, we propose the framework process of a deep learning music production (DLMP) system, and each work process and the main work content of each module in the system are described. We used upstream, midstream, and downstream industry roles for comparison and used an innovative operating model to describe the overall industry transformation. Through this research, we can more clearly see a blueprint for the future development of music production and can provide operators with a competitive advantage strategy.

主题分类 基礎與應用科學 > 資訊科學
社會科學 > 經濟學
社會科學 > 財金及會計學
社會科學 > 管理學
参考文献
  1. Wiki. Frequency[Online]. Available: https://en.wikipedia.org/wiki/Frequency
  2. (2003).The Electronic Handbook.
  3. Abd El-Fattah, M. A.,Dessouky, M. I.,Diab, S. M.,Abd El-samie, F. E.(2008).Speech enhancement using an adaptive wiener filtering approach.Progress In Electromagnetics Research M
  4. BAI, J.-L.(2007).Taiwan,Business Administration, National Chengchi University.
  5. Begoli, E.,Horey, J.(2012).Design principles for effective knowledge discovery from big data.2012 Joint Working IEEE/IFIP Conference on Software Architecture and European Conference on Software Architecture
  6. Berners-Lee, T.,Fielding, R.,Frystyk, H.(1996).T. Berners-Lee, R. Fielding, and H. Frystyk, "Hypertext transfer protocol–http/1.0," Network Working Group, pp. 1-122, 1996..
  7. Berouti, M.,Schwartz, R.,Makhoul, J.(1979).Enhancement of speech corrupted by acoustic noise.ICASSP '79. IEEE International Conference on Acoustics, Speech, and Signal Processing
  8. Bhatnagar, M.,Loizou, P. C.(2001).A cross-correlation technique for enhancing speech corrupted with correlated noise.2001 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings (Cat. No.01CH37221)
  9. Boll, S.(1979).Suppression of acoustic noise in speech using spectral subtraction.IEEE Transactions on Acoustics, Speech and Signal Processing,27,113-120.
  10. Boll, S.(1979).A spectral subtraction algorithm for suppression of acoustic noise in speech.Acoustics, Speech, and Signal Processing, IEEE International Conference on ICASSP'79
  11. Boll, S. F.(1978).Supression of noise in speech using the saber method.Ieee,3,606-609.
  12. Boulanger-Lewandowski, N.,Bengio, Y.,Vincent, P.(2012).,未出版
  13. Braasch, J.(2011).A cybernetic model approach for free jazz improvisations.Kybernetes,40,984-994.
  14. H. Cheng. (2013). Why keys are arranged in a geometric sequence[Online]. Available: https://wenku.baidu.com/view/ca76790cb52acfc789ebc9bb.html
  15. W. Contributors, "Transmission Control Protocol," Wikipedia, The Free Encyclopedia., 2014.
  16. Eck, D.,Schmidhuber, J.(2002).A first look at music composition using lstm recurrent neural networks.Istituto Dalle Molle Di Studi Sull Intelligenza Artificiale,103
  17. Elman, J. L.(1990).Finding structure in time.Cognitive science,14(2),179-211.
  18. Engel, J.(2017).,未出版
  19. Ephraim, Y.,Cohen, I.(2006).Recent advancements in speech enhancement.The electrical engineering handbook
  20. Ephraim, Y.,Ephraim, Y.(1992).Statistical-model-based speech enhancement systems.Proceedings of the IEEE
  21. Gong, Y.(1995).Speech recognition in noisy environments: A survey.Speech Communication,16,261-291.
  22. Good, M.(2001).Musicxml for notation and analysis.The Virtual Score: Representation, Retrieval, Restoration,12,113-124.
  23. Goralski, W.(2017).The illustrated network: how TCP/IP works in a modern network.Morgan Kaufmann.
  24. Graham, G.,Burnes, B.,Lewis, G. J.,Langer, J.(2004).The transformation of the music industry supply chain: A major label perspective.International Journal of Operations & Production Management,24,1087-1103.
  25. Hild, H.,Feulner, J.,Menzel, W.(1992).Harmonet: A neural net for harmonizing chorales in the style of JS Bach.Advances in neural information processing systems
  26. Hochreiter, S.,Schmidhuber, J.(1997).Long short-term memory.Neural computation,9(8),1735-1780.
  27. Hörnel, D.,Degenhardt, P.(1997).,未出版
  28. Hörnel, D.,Menzel, W.(1998).Learning musical structure and style with neural networks.Computer Music Journal,22(4),44-62.
  29. Jen, S.-R.,Guo, Q.-t.,Wang, S.-p.,Yang, Y.-r.,Yan, C.-h.,Zhou, G.-j.(2008).Introduction and Application of Multimedia.Taipei:Flag Publishing.
  30. W. Joe. (2005). Note names, MIDI numbers and frequencies[Online]. Available: https://newt.phys.unsw.edu.au/jw/notes.html
  31. Jordan, M. I.(1997).Serial order: A parallel distributed processing approach.Advances in psychology,121,471-495.
  32. Kamath, S.,Loizou, P.,States, U.(2002).A multi-band spectral subtraction method for enhancing speech corrupted by colored noise an event-based acoustic-phonetic approach for speech segmentation and e-set recognition.Proceedings of International Conference on Acoustics, Speech, and Signal Processing,Orlando, USA:
  33. Kirke, A.,Miranda, E. R.(2009).A survey of computer systems for expressive music performance.ACM Computing Surveys,42,1-41.
  34. Kogut, B.(1985).Designing global strategies: Profiting from operational flexibility.Sloan Management Review,27,27-38.
  35. T. Kubo. (2017). Next Music Production by Google Magenta[Online]. Available: https://www.slideshare.net/takahirokubo7792/tech-circle-23-next-music-productionby-google-magenta
  36. Lawrence, S.,Giles, C. L.,Tsoi, A. C.,Back, A. D.(1997).Face recognition: A convolutional neural-network approach.IEEE transactions on neural networks,8(1),98-113.
  37. LeCun, Y.,Bengio, Y.,Hinton, G.(2015).Deep learning.Nature,521(7553),436-444.
  38. Lee, I.-H.(2006).Department of Music, National Taipei University of Education.
  39. Liu, L.-w.(2009).Taiwan,Information Management, National Taiwan University of Science and Technology.
  40. Liu, Y.-T.(2012).Taiwan,Electricl Engineering, National Dong Hwa University.
  41. Lo, Y.-h.(2009).Taiwan,Music, National Taiwan Normal University.
  42. Lockwood, P.,Boudy, J.(1992).Experiments with a nonlinear spectral subtractor (NSS), hidden Markov models and the projection, for robust speech recognition in cars.Speech communication,11(2-3),215-228.
  43. Lu, S.-H.(2012).Taiwan,Computer Science and Information Engineering, Shu-Te University.
  44. Malik, I.(2017).DEPARTMENT OF COMPUTER SCIENCE, University of Bristol.
  45. Meisel, J. B.,Sullivan, T. S.(2002).The impact of the internet on the law and economics of the music industry.info,4,16-22.
  46. Mitchell, D.,Coles, C.(2003).The ultimate competitive advantage of continuing business model innovation.Journal of Business Strategy,24,15-21.
  47. Mozer, M. C.(1994).Neural network music composition by prediction: Exploring the benefits of psychoacoustic constraints and multi-scale processing.Connection Science,6(2-3),247-280.
  48. Non-member, S. O.,Shimamura, T.(2001).Reinforced spectral subtraction method to enhance speech signal.Evaluation
  49. Osterwalder, A.,Pigneur, Y.(2004).An ontology for e-business models.Value creation from e-business models
  50. Osterwalder, A.,Pigneur, Y.(2005).Clarifying business models : origins , present , and future of the concept.Communications of the association for Information Systems,15,1-125.
  51. Ott, J.,Chesterfield, J.,Schooler, E.(2010).J. Ott, J. Chesterfield, and E. Schooler, "Rtp control protocol (rtcp) extensions for single-source multicast sessions with unicast feedback," Request for Comments (RFC) 5760, pp. 1-66, 2010..
  52. Pateli, A. G.,Giaglis, G. M.(2005).Technology innovation‐induced business model change: a contingency approach.Journal of Organizational Change Management,18,167-183.
  53. Porter, M.(1985).Creating and sustaining superior performance. Competitive Advantage.NY:Free Press.
  54. Preston, P.,Rogers, J.(2011).Social networks, legal innovations and the “new” music industry.info,13,8-19.
  55. Schmidhuber, J.(2015).Deep learning in neural networks: An overview.Neural networks,61,85-117.
  56. Schulzrinne, H.,Casner, S.,Frederick, R.,Jacobson, V.(2003).H. Schulzrinne, S. Casner, R. Frederick, and V. Jacobson, "Rtp: a transport protocol for real-time applications," Request for Comments (RFC) 3550, pp. 1-89, 2003..
  57. Schulzrinne, H.,Rao, A.,Lanphier, R.(1998).H. Schulzrinne, A. Rao, and R. Lanphier, "Real Time Streaming Protocol (RTSP)," Request for Comments (RFC) 2326, 1998..
  58. Schuster, M.,Paliwal, K. K.(1997).Bidirectional recurrent neural networks.IEEE Transactions on Signal Processing,45(11),2673-2681.
  59. Sim, B. L.,Tong, Y. C.,Chang, J. S.,Tan, C. T.(1998).A parametric formulation of the generalized spectral subtraction method.IEEE Transactions on Speech and Audio Processing,6,328-336.
  60. Sovka, P.,Pollak, P.,Kybic, J.(1996).Extended spectral subtraction.European Signal Processing Conference (EUSIPCO--96)
  61. Sturm, B. L.(2019).Machine learning research that matters for music creation: A case study.Journal of New Music Research,48(1),36-55.
  62. Sutskever, I.,Hinton, G. E.,Taylor, G. W.(2009).The recurrent temporal restricted boltzmann machine.Advances in Neural Information Processing Systems
  63. Swatman, P. M. C.,Krueger, C.,van der Beek, K.(2006).The changing digital content landscape: An evaluation of e‐business model development in European online news and music.Internet Research,16,53-80.
  64. Tu, T.-H.(2010).Taiwan,Electrical Engineering, Southern Taiwan University of Science and Technology.
  65. Upadhyay, N.,Karmakar, A.(2013).Spectral subtractive-type algorithms for enhancement of noisy speech: an integrative review.International Journal of Image, Graphics & Signal Processing,5(11)
  66. Viglianti, R.(2007).Musicxml: an xml based approach to musicological analysis.Digital Humanities 2007: Conference Abstracts
  67. Virag, N.(1999).Single channel speech enhancement based on masking properties of the human auditory system.IEEE Transactions on Speech and Audio Processing,7,126-137.
  68. W. Wcat. (2011). Asio, Asio 4All, Ks, Was Api[Online]. Available: http://www.360doc.com/content/11/0910/12/7558399_147244236.shtml
  69. Weis, A. H.(2010).Commercialization of the internet.Internet Research,20,420-435.
  70. Weng, S.-S. C.,Hung-Chia(2020).Exploring the role of deep learning technology in the sustainable development of the music production industry.Sustainability,12(2),625.
  71. Weng, S.-S.,Chen, H.-C.(2020).Exploring the role of deep learning technology in the sustainable development of the music production industry.Sustainability,12(2),625.
  72. Williams, D. B.,Webster, P. R.(2008).Experiencing Music Technology.USA:Schirmer Cengage Learning.
  73. Y, Wan-jun(2014).From midi to musicxml-the development of computer music score information exchange format.Entertainment Technology,45-49.
  74. Yeh, H. T.,Chiou, J. S.,Zhou, T. J.(2013).A karaoke system with real-time media merging and sharing functions for a cloud-computing-integrated mobile device.Mathematical Problems in Engineering,2013