Go-OPML is a command-line tool that converts OPML (Outline Processor Markup Language) files to JSON format and optionally fetches RSS feeds for podcasts listed in the OPML file.
The official package documentation is now available on pkg.go.dev. Visit this page for detailed API documentation, examples, and more information about using Go-OPML in your projects.
- Go-OPML
- Package Documentation
- Table of Contents
- Features
- Process Overview
- Pain Points Solved
- Getting Started
- Installation and Usage
- Usage
- Project Structure
- Cross-Platform Compatibility
- Examples
- Output
- Troubleshooting
- Performance Showcase
- Technologies Used
- Dependencies and Modules
- Author
- Contributing
- Acknowledgements
- License
- Convert OPML files to JSON format
- Fetch RSS feeds for podcasts (optional)
- Concurrent RSS feed fetching for improved performance
- Customizable number of episodes to fetch per podcast
- Adjustable timeout for RSS feed fetching
- Control over concurrent feed fetches
The following flowchart illustrates the OPML to JSON conversion process:
graph TD
A[Start] --> B[Parse OPML File]
B --> C{Fetch RSS?}
C -->|Yes| D[Fetch RSS Feeds Concurrently]
C -->|No| E[Create JSON Structure]
D --> E
E --> F[Generate JSON Output]
F --> G[Write to File]
G --> H[End]
style A fill:#f9d71c,stroke:#333,stroke-width:2px
style H fill:#f9d71c,stroke:#333,stroke-width:2px
style C fill:#f9f,stroke:#333,stroke-width:2px
style D fill:#bbf,stroke:#333,stroke-width:2px
style E fill:#bfb,stroke:#333,stroke-width:2px
style F fill:#bfb,stroke:#333,stroke-width:2px
- The process begins by parsing the input OPML file.
- If RSS fetching is enabled, the tool concurrently fetches RSS feeds for each podcast.
- A JSON structure is created, incorporating the OPML data and fetched RSS information (if applicable).
- The JSON output is generated.
- Finally, the resulting JSON is written to the specified output file.
This streamlined process allows for efficient handling of podcast subscription data, with optional RSS feed fetching for comprehensive podcast information.
Go-OPML addresses several challenges faced by podcast enthusiasts and developers working with podcast data:
-
OPML to JSON Conversion: Many podcast applications use OPML for importing and exporting podcast subscriptions. However, some may prefer to work with JSON. Go-OPML simplifies this by converting OPML to JSON, a format that's easier to work with in many programming languages and environments.
-
Bulk RSS Feed Fetching: Manually fetching RSS feeds for multiple podcasts can be time-consuming. Go-OPML automates this process, allowing you to fetch data for all your podcasts in one go.
-
Concurrent Processing: Fetching multiple RSS feeds sequentially can be slow. Go-OPML uses concurrent processing to fetch multiple feeds simultaneously, significantly reducing the overall processing time.
-
Customizable Output: The tool allows you to control how many episodes to fetch per podcast and provides timeout settings, giving you flexibility in managing the amount of data and processing time.
-
Standardized Data Format: By outputting podcast and episode data in a consistent JSON format, Go-OPML makes it easier to integrate podcast data into various applications or analysis tools.
-
Command-Line Interface: Go-OPML provides a simple CLI, making it easy to integrate into scripts or other automated workflows.
-
Cross-Platform Compatibility: Built with Go, the tool can be compiled and run on various operating systems, providing a consistent experience across different platforms.
By addressing these pain points, Go-OPML streamlines the process of working with podcast subscription data, whether you're building a podcast app, analyzing podcast trends, or managing your personal podcast subscriptions.
To quickly get started with Go-OPML:
-
Download the latest release for your platform from the Releases page.
-
Extract the downloaded file.
-
Open a terminal and navigate to the extracted directory.
-
Run the following command to convert an OPML file to JSON:
./Go-OPML -input your_file.opml -output result.json
-
To also fetch RSS feeds, add the
-fetch-rss
flag:./Go-OPML -input your_file.opml -output result.json -fetch-rss
For most use cases, especially if you're integrating Go-OPML functionality into your Go projects, using it as a library is recommended:
-
In your Go project, import the package:
import "github.com/jadmadi/Go-OPML"
-
Run
go mod tidy
to add the dependency to your project:go mod tidy
This method allows you to use Go-OPML's functions directly in your code. For usage examples and API details, please refer to the package documentation.
If you need to use Go-OPML as a standalone command-line tool:
Ensure you have Go 1.16 or later, then run:
go install github.com/jadmadi/Go-OPML/cmd/go-opml@latest
This installs the latest version of the Go-OPML CLI. You can then run it by typing go-opml
in your terminal.
To install a specific version:
go install github.com/jadmadi/Go-OPML/cmd/[email protected]
For contributors or those who need more control:
-
Clone the repository:
git clone https://github.com/jadmadi/Go-OPML.git cd Go-OPML
-
Install dependencies:
go mod tidy
-
Build the application:
go build -o build/Go-OPML cmd/go-opml/main.go
For CLI installations, verify by running:
go-opml --version
This should display the version number of Go-OPML.
- If using as a library, ensure you're using the correct import path and have run
go mod tidy
. - For CLI usage, if the
go-opml
command is not found, check that your Go bin directory is in your PATH. - For any issues, please open an issue on our GitHub repository.
-
If the
go-opml
command is not found, ensure your Go bin directory is in your PATH. You can find your Go bin directory by runninggo env GOPATH
and then add/bin
to that path. -
If you encounter any issues with dependencies, you can manually install them:
go get github.com/mmcdole/gofeed
For any other installation issues, please open an issue on our GitHub repository.
Basic usage:
./build/Go-OPML -input <input_opml_file> -output <output_json_file> [options]
Here's the full list of options available (output of ./Go-OPML --help
):
Go-OPML: OPML to JSON converter with RSS feed fetching capabilities
Usage: ./Go-OPML [options]
Options:
-concurrency int
Number of concurrent RSS feed fetches (default 5)
-fetch-rss
Fetch RSS feeds and include episode information
-input string
Path to the input OPML file
-max-episodes int
Maximum number of episodes to fetch per podcast (default 10)
-output string
Path to the output JSON file (default "output.json")
-timeout duration
Timeout for fetching RSS feeds (default 30s)
Examples:
Convert OPML to JSON:
./Go-OPML -input podcasts.opml -output podcasts.json
Convert OPML to JSON and fetch RSS feeds:
./Go-OPML -input podcasts.opml -output podcasts.json -fetch-rss
Fetch RSS feeds with custom settings:
./Go-OPML -input podcasts.opml -output podcasts.json -fetch-rss -max-episodes 20 -timeout 1m -concurrency 10
The Go-OPML project is organized as follows:
go-opml/
├── build/
│ ├── Go-OPML
│ ├── Go-OPML.exe
│ └── Go-OPML-mac
├── cmd/
│ └── go-opml/
│ └── main.go
├── examples/
│ ├── sample.json
│ └── sample.opml
├── internal/
│ ├── json/
│ │ └── generator.go
│ └── opml/
│ └── parser.go
├── pkg/
│ └── rss/
│ └── fetcher.go
├── test/
│ └── parser_test.go
├── go.mod
├── go.sum
├── LICENSE
├── README.md
└── Waqf.en.md
build/
: Contains compiled binaries for different platforms.cmd/go-opml/
: Contains the main application entry point.examples/
: Holds sample OPML and JSON files for testing.internal/
: Contains packages that are internal to the application.pkg/
: Houses packages that could potentially be used by external projects.test/
: Contains test files.
Go-OPML can be compiled for different platforms from a Linux machine. Use the following commands:
-
For Windows (64-bit):
GOOS=windows GOARCH=amd64 go build -o build/Go-OPML.exe cmd/go-opml/main.go
-
For macOS (64-bit):
GOOS=darwin GOARCH=amd64 go build -o build/Go-OPML-mac cmd/go-opml/main.go
-
For Linux (64-bit):
GOOS=linux GOARCH=amd64 go build -o build/Go-OPML-linux cmd/go-opml/main.go
Here's a small sample OPML file and its resulting JSON output:
Sample OPML (sample.opml
):
<?xml version="1.0" encoding="UTF-8"?>
<opml>
<head/>
<body version="1.0">
<outline xmlUrl="https://feeds.megaphone.fm/thediaryofaceo" title="The Diary Of A CEO with Steven Bartlett"/>
<outline xmlUrl="https://feeds.simplecast.com/e_GRxR9a" title="Azeem Azhar's Exponential View"/>
<outline xmlUrl="http://feeds.harvardbusiness.org/harvardbusiness/ideacast" title="HBR IdeaCast"/>
</body>
</opml>
Command:
./Go-OPML -input sample.opml -output sample.json -fetch-rss
Resulting JSON (sample.json
):
[
{
"title": "The Diary Of A CEO with Steven Bartlett",
"url": "https://feeds.megaphone.fm/thediaryofaceo",
"episodes": [
{
"title": "The Investing Expert: We're Raising The Most Unhappy Generation In History! The Unseen Link Between Marriage & Wealth! Hard Work Doesn't Build Wealth! - Scott Galloway",
"link": "",
"publishDate": "2024-07-11T05:00:00Z",
"description": "The millionaire entrepreneur revealing Wall Street's secrets and simplifying finance for the masses\n\n Scott Galloway is a Professor of Marketing at the New York Stern School of Business and host of the 'Pivot' podcast about technology and business. He is also the best-selling author of books such as, 'The Algebra of Happiness', 'The Four', and 'The Algebra of Wealth'. \n\nIn this conversation, Scott and Steven discuss topics such as, billionaire money hacks, the 6 fundamental rules of investing, how to make $7 million from nothing, and how marriage can make you rich.\n\n..."
}
]
},
{
"title": "Azeem Azhar's Exponential View",
"url": "https://feeds.simplecast.com/e_GRxR9a",
"episodes": [
{
"title": "The Science of Making Truthful AI",
"link": "https://hbr.org/podcasts/exponential-view",
"publishDate": "2024-02-07T13:17:29Z",
"description": "Artificial Intelligence is on every business leader's agenda. How do we make sense of the fast-moving new developments in AI over the past year? Azeem Azhar returns to bring clarity to leaders who face a complicated information landscape.\n\nThis week, Azeem speaks with Richard Socher, CEO and founder of You.com, an AI chatbot search engine at the forefront of truthful and verifiable AI. They explore approaches to building AI systems that are both truthful and verifiable. The conversation sheds light on the critical breakthroughs in AI, the technical challenges of ensuring AI's reliability, and Socher's vision for the future of search.\n\n..."
}
]
},
{
"title": "HBR IdeaCast",
"url": "http://feeds.harvardbusiness.org/harvardbusiness/ideacast",
"episodes": [
{
"title": "Why We Should Pay More Attention to Departing CEOs",
"link": "https://hbr.org/podcast/2024/07/why-we-should-pay-more-attention-to-departing-ceos",
"publishDate": "2024-07-09T13:00:32Z",
"description": "When news breaks of a CEO succession, much of the attention is given to the new leader and how they will change the company. But new research shows that the leave-taking process of the outgoing chief executive is often mishandled, with negative impacts on succession and the organization. Rebecca Slan Jerusalim, an executive director at Russell Reynolds Associates, and Navio Kwok, a leadership advisor at RRA, say that boards are often surprised when a CEO gives notice, and they often make that person feel excluded during the handoff process. The researchers share stories from the front lines about CEO psychology, best practices for outgoing leaders and their boards, and broader lessons for effective transitions. Jerusalim and Kwok wrote the HBR article \"The Vital Role of the Outgoing CEO.\""
}
]
}
]
The tool generates a JSON file containing an array of podcast objects. Each podcast object includes:
title
: The title of the podcasturl
: The URL of the podcast's RSS feedepisodes
: (If RSS fetching is enabled) An array of recent episodes, each containing:title
: Episode titlelink
: Link to the episodepublishDate
: Publication date of the episodedescription
: Episode description
Here are some common issues and their solutions:
-
"command not found" error when running Go-OPML
- Ensure you're in the correct directory containing the Go-OPML executable.
- Check if the file has execute permissions. If not, run:
chmod +x Go-OPML
-
Error parsing OPML file
- Verify that your OPML file is well-formed XML.
- Ensure the file path is correct and the file is readable.
-
Timeout errors when fetching RSS feeds
- Increase the timeout duration:
-timeout 60s
- Check your internet connection.
- Increase the timeout duration:
-
Program seems to hang
- For large OPML files, the process might take some time. Use the
-concurrency
flag to speed up RSS fetching. - Ensure you have a stable internet connection.
- For large OPML files, the process might take some time. Use the
If you encounter any other issues, please open an issue on GitHub.
Go-OPML is designed for efficiency, particularly when handling multiple RSS feeds concurrently. Here's a real-world example of its performance:
- Input OPML file: Contains 19 podcast feeds
- Operation: Convert OPML to JSON and fetch RSS feeds for all podcasts
- Hardware: [Your system specifications - this information wasn't provided, so you may want to add it]
$ time ./Go-OPML -input jad.opml -output output.json -fetch-rss
Successfully processed OPML and generated JSON. Output written to output.json
real 0m3.813s
user 0m1.662s
sys 0m0.238s
- Total execution time: 3.813 seconds
- Number of feeds processed: 19
- Average time per feed: Approximately 0.2 seconds
This test demonstrates Go-OPML's ability to process a substantial number of podcast feeds quickly. In just under 4 seconds, the tool:
- Parsed the OPML file
- Fetched RSS data for 19 different podcasts
- Processed the fetched data
- Generated and wrote the output JSON file
The efficiency is largely due to Go-OPML's use of concurrent processing, allowing it to fetch and process multiple RSS feeds simultaneously. This makes Go-OPML an excellent choice for handling large OPML files with many podcast subscriptions.
Note: Performance may vary depending on factors such as internet connection speed, server response times of the RSS feeds, and the hardware specifications of the system running Go-OPML.
Go-OPML is built using the following technologies:
- Go (Golang): The primary programming language used for the project.
- XML Processing: Go's built-in
encoding/xml
package for parsing OPML files. - JSON Processing: Go's built-in
encoding/json
package for generating JSON output. - Concurrency: Go's goroutines and channels for concurrent RSS feed fetching.
- Command-line Flags: Go's
flag
package for parsing command-line arguments.
Go-OPML relies on the following external module:
- gofeed: A robust feed parser for Go. It's used to parse RSS feeds and extract podcast episode information.
To install the dependencies, run:
go get github.com/mmcdole/gofeed
- Name: Jad Madi
- GitHub: @jadmadi
- Email: [email protected]
- Website: https://madi.se
- LinkedIn: https://linkedin.com/in/hakammadi
For professional inquiries or collaborations, please contact via LinkedIn or website.
Contributions are welcome! Please feel free to submit a Pull Request.
We would like to express our gratitude to the creators and maintainers of the following project:
- gofeed: A robust feed parser for Go, which has been instrumental in enabling efficient RSS feed parsing in this project.
This project is licensed under the GNU General Public License v3.0. See the LICENSE file for details.
Additionally, this work adheres to the principles of the Waqf General Public License, aiming to make the work a perpetual charitable endowment (Waqf) for the benefit of all Muslims.